WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12P 7/24, C12N 9AM), 15/60, 9/02, 9/88 



A2 



(11) International Publication Number: 
(43) International Publication Date: 



WO 97/35999 

2 October 1997(02.10.97) 



(21) International Application Number: PCT/GB97/00809 

(22) International Filing Date: 24 March 1997 (24.03.97) 



(30) Priority Data: 
9606187.4 



23 March 1996 (23.03.96) GB 



(71) Applicant (for ail designated States except US): INSTITUTE 
OF FOOD RESEARCH [GB/GB]; Norwich Research Park, 
Colney. Norwich NR4 7UA (GB). 

(72) Inventors; and 

(75) Inventoi^Applkants (for OS only): NARBAD, Arjan 
[GB/GB]; 61 The Avenues, Norwich, Norfolk NR2 3QR 
(GB). RHODES, Michael, John, Charles [GB/GB]; 35 
Brcttingham Avenue, Cringleford, Norwich, Norfolk NR4 
6XQ (GB). GASSON. Michael, John [GB/GB]; Old Ash 
Farm, Stone Lane, Bintree, nr East Dereham, Norfolk 
NR20 SNA (GB). WALTON, Nicholas, John [GB/GB]; 69 
City Road, Norwich, Norfolk NR1 3 AS (GB). 

(74) Agent: BASSETT, Richard; Eric Potter Clarkson, St. Mary's 
Court, St. Mary's Gate, Nottingham NG1 ILE (GB). 



(54) Title: PRODUCTION OF VANILLIN 
(57) Abstract 

A method of producing vanillin comprising the steps of* (1) 
providing fro/u-ferulic acid or a salt thereof; and (2) providing 
n-o^-femlate: CoASH ligase activity (enzyme activity I), trans- 
feruloyl SCoA hydratase activity (enzyme activity II), and 4-hydroxy-3- 
meAoxyphenyl-^-hydroxy-propionyl SCoA (HMPHP SCoA) cleavage 
activity (enzyme activity HI). Conveniently the enzymes are provided 
by Pseudomonas fluoresces Fe3 or a mutant or derivative thereof 
Polypeptides with enzymes activities II and HI and polynucleotides 
encoding said polypeptides. Use of said polypeptides or said polynu- 
cleotides in a method for producing vanillin. 



(81) Designated States: AU, CA, JP, NO, US, European patent (AT 
BE, CH. DE, DK. ES, R, FR, GB, GR, IE, IT, LU. fAc] 
NL, PT, SE). 



Published 

Without international search report and to be republished 
upon receipt of that report 



US 7\t ^3,011 



AN103 



OMe »»Mjr>jmoyfscoA 



H T i 

OMe 



CH-CKi-COSCoA 
OH 



CHjCOSCoA 



XT"" 

OMe 



OH 



nhgdwMQe 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


liCsotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Fid Land 


LT 


Lithuania 


SK 


Slovakia 


AT 


Aii&tru 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


CN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


Cote d*l voire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


FT 


Portugal 






CU 


Cuba 


KZ 


Kazakhstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singipore 







WO 97/35999 



PCT/GB97/00809 

1 

PRODUCTION OF VANII I TM 



The present invention relates principally to the production of vanillin (4- 

hydroxy-3-methoxy^^ 

other than by extraction from the Vanilla pod. 

Vanillin is an important food and drink flavouring agent and a major 
flavour component of natural vanilla from the Vanilla pod. The use of 
natural vanilla is limited by its high price. Synthetic vanillin, commonly 
derived from sulphite liquors produced during the processing of wood pulp 
for paper manufacture, is frequently used as a low-cost vanilla substitute. 
Alternative biological processes for the production of natural vanillin and 
allied flavourings would have considerable industrial value and utility, 
most particularly if such processes could facilitate the production of 
vanillin and/or allied flavourings directly in a fermented food or beverage. 

The mechanism of vanillin biosynthesis in Vanilla remains substantially 
uncharacterised. M.H. Zenk {Anal. Z. Pflanzenphysiol 53, 404-414 
(1965)) showed that vanillin was derived from /mn J -ferulate(4-hydroxy-3- 
methoxy-rra«,-cinnamat e ) and proposed a mechanism analogous to the 
classical 0-oxidation of fatty acids, with cleavage of a /J-keto thioester to 
produce acetyl SCoA and vanilloy. SCoA (4-hydroxy-3-meuioxyben2oyl 
SCoA) and subsequent reduction and CoASH release to generate vanillin. 
C. Funk and P.E. Brodelius {Plant Physiol. 94, 95-101; 102-108 (1990); 
99, 256-262 (1992)), proposed a different route, in which the 4-hydroxy 
group of mm*-ferulate became successively methylated and demethylated 
during the pathway of vanillin biosynthesis; however, the detailed 
enzymology was not elucidated. In potato tubers and in the fungus. 



Polyporus hispidus (C.J. French, CP. Vance and G.H.N. Towers, 
Phytochemistry 15, 564-566 (1976)), in cell cultures of Uthospermum 
erythrorhizon (K. Yazaki, L. Heide and M. Tabata, Phytochemistry 30, 
2233-2236 (1991)) and in cell cultures of carrot (J.-P. Schnitzler, J. 
Madlung, A. Rose and H.U. Seitz, Planta 188, 594-600 (1992)), evidence 
was obtained from in vitro studies that the corresponding analogue of 
vanillin, 4-hydroxybenzaldehyde, was an intermediate in the formation of 
4-hydroxybenzoate from 4-coumarate (4-hydroxy-rra/tf-cinnamate). There 
was no requirement for ATP or CoASH, thus apparently ruling out a 0- 
oxidation mechanism. Further studies with cell-free extracts of 
Uthospermum erythrorhizon, however, have in contrast recently 
established the presence of a /3-oxidation route for the conversion of 4- 
coumarate to 4-hydroxybenzoate (R. Ldscher and L. Heide, Plant Physiol. 
106, 271-279 (1994)); in this case, the conversion was dependent on ATP, 
Mg 2+ ions and NAD + and proceeded via 4-hydroxybenzoyl SCoA, without 
the intermediate formation of 4-hydroxybenzaldehyde. 

In the Gram-negative bacterium, Pseudomonas acidovorans, /ra/u-ferulate 
was shown to be catabolised to vanillate and acetate, apparently via 
vanillin (A. Toms and J.M. Wood, Biochemistry 9, 337-343 (1970)). 
Although in cell-free extracts NAD + was necessary for the oxidation of 
vanillin to vanillate and for the further oxidation of vanillate to 
protocatechuate and formate, no mention was made of any other cofactor 
requirements. Further studies of ferulate utilisation in Pseudomonas 
species have been reported (V. Andreoni and G. Bestetti, FEMS 
Microbiology Ecology 53, 129-132 (1988); T. Omori, K. Hatakeyama and 
T. Kodama, Appl. Microbiol. Biotechnol. 29, 497-500 (1988); Z. Huang, 
L. Dostal and J.P.N. Rosazza, Appl. Env. Microbiol. 59, 2244-2250 
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(1993)); however, these have not sought to elucidate further the 
mechanism of the two-carbon cleavage of ferulate. Zenk et al (1980) 
Anal. Biochem. 101, 182-187 describe a procedure for the enzymatic 
synthesis and isolation of cinnamoyl-CoA thioesters using a bacterial 
system. In contrast, the enzymology and genetics of the utilisation of 
simple benzene derivatives, including benzoic acids and phenols, by 
Pseudomonas have been intensively studied (T.K. Kirk, T. Higuchi and 
H.-M. Chang (eds.), Ugnin biodegradation\ CRC Press, Boca Raton 
Ha, USA (1980); D.T. Gibson (ed.), Microbial degradation of organic 
compounds^, Marcel Dekker, New York (1984); J.L. Ramos, A. 
Wasserfallen, K. Rose and K.N. Timmis, Science 235, 593-596 (1987); 
C.S. Harwood, N.N. Nichols, M.K. Kim, J.L. Ditty and R.E. Parales, 
/. Baaeriol. 176, 6479-6488 (1994); S. Romerosteiner, R.E. Parales, 
C.S. Harwood and J.E. Houghton, /. Baaeriol. 176, 5771-5779 (1994); 
J. Inoue, J.P. Shaw, M. Rekik and S. Harayama, J. Baaeriol. 177. 1196- 
1201 (1995)). 



A survey of potential microbial routes to aromatic aldehydes, including 
routes (i) from rra^-cinnamic acids, (ii) from benzoic acids by reduction 
and (iii) by conversion of aromatic amino acids to phenylpyruvic acids 
followed by treatment with base, has been presented by J. Casey and R. 
Dobb {Enzyme Microb. Technol. 14, 739-747 (1992)). 

US 5,128,253 describes a method of producing vanillin from ferulic acid 
by various microorganisms and extracts thereof or enzymes derived 
therefrom in the presence of a sulphydryl compound but does not disclose 
what any of the enzymes involved in the conversion of ferulic acid to 
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vanillin are. US 5,279,950 is a continuation-in-part application of US 
5,128 253 which additionally describes that Vanilla calluses can be used 
in the process. 

5 WO 94/13614 describes the production of vanillin from ferulic acid by the 
action of Vanilla root material and makes use of an adsorbent, such as 
charcoal, to extract vanillin but does not disclose the specific enzymes 
involved. 

10 EP 0 453 368 describes that a culture of Pycnoporus can convert 
frarts-ferulic acid into vanillin but does not disclose the specific enzymes 
involved. 

WO 94/02621 describes the production of vanillin from frons-ferulic acid 
15 by the action of a lipoxygenase enzyme. EP 0 405 197 describes the 
production of vanillin from eugenol/isoeugenol by bacteria from the 
genera Serratia, Klebsiella and Enterobacter by oxidation. 

Vanillin may also be produced from phenolic stilbenes as is mentioned in 
20 Hagedorn & Kaphammer (1994) Ann, Rev. Microbiol 48, 773-800. 

Vanillic acid is also a useful compound as it can be polymerised into 
oligomers or used as a monomer in the synthesis of polyesters; similarly 
p-hydroxybenzoic acid is also useful for polymer synthesis. 

25 

A first aspect of the invention provides a method of producing vanillin 
comprising the steps of 
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(1) providing trans-femlic acid or a salt thereof; and 

(2) providing fm™-ferulate:CoASH ligase activity (enzyme activity I), 
tmns-feruloyl SCoA hydratase activity (enzyme activity II), and 4- 
hydroxy-3-methoxyphenyI-^-hydroxy-propionyl SCoA (HMPHP 
SCoA) cleavage activity (enzyme activity III). 

The advantages of the present invention over chemical synthesis or 
extraction from the Vanilla pod include (i) economic advantage over 
extraction from Vanilla pod and freedom from geographical dependence 
on Vanilla growing areas; (ii) the ability to produce vanillin by a natural 
process, involving biological catalysts; (iii) the benefits of generating a 
natural flavour in situ in a fermented food or beverage, if the genes are 
expressed in appropriate food-grade hosts - eg lactic acid bacteria or 
yeasts; and (iv) the possibility of expanding the range of plants in which 
vanillin and related substances might be produced and from which they 
might be extracted. These and other examples of the methods of the 
invention are described in more detail below. 

We have determined the mechanism of chain-shortening of rr^-f eru late 
(m™-ferulic acid) by a strain of Pseudomonas fluorescent (named Ps. 
fluorescent biovar. V, strain AN103 and which we have abbreviated at 
some points to AN103) isolated from son. Our data indicate clearly that 
vanillin (4-hydroxy,3-methoxy benzaldehyde) is an intermediate and that 
the mechanism does not involve Oxidation. The vanillin pathway of ft. 
fluoresces biovar. V, strain AN103 is described in Figure 1. 
W-fcrulic acid (or a salt thereof) is interconverted with mm-feruloyl 
SCoA in the presence of CoASH; /r^-feruloyl SCoA is interconverted 
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with 4-hydroxy-3-methoxyphenyl-/3-hydroxypropionyl SCoA (HMPHP 
SCoA); and HMPHP SCoA is interconverted with vanillin. 

For convenience, mwu-ferulate:CoASH ligase activity is called enzyme 
5 activity I, rraw-feruloyl SCoA hydrolase activity is called enzyme activity 
II; and HMPHP SCoA cleavage activity is called enzyme activity III. The 
interconversions performed by these enzyme activities is shown in Figure 
1. 

10 The method of producing vanillin provided by the invention therefore 
includes the steps of exposing taww-ferulic acid or a salt thereof to enzyme 
activity I and forming a product, exposing the said product of enzyme 
activity I to enzyme activity II to form a product and exposing the said 
product of enzyme activity II to enzyme activity III to form a product. 

15 

I/my-ferulic acid or a salt thereof may be provided directly, for example 
by supplying pre-prepared /rans-ferulic acid or a salt thereof, or it may be 
provided indirectly, for example by supplying a precursor of taww-fenilic 
acid or a precursor of a salt of trans-ierulk acid and means to convert the 
20 said precursor into /ra/w-ferulic acid or a salt thereof. As is described in 
more detail below, it is convenient if the precursor is an ester of trans- 
ferulic acid and the means to convert said ester is a suitable esterase. 

By "providing fra/u-ferulate:CoASH ligase activity (enzyme activity I), 
25 rrcwtf-feruloyl SCoA hydratase activity (enzyme activity II), and 4- 
hydroxy-3-methoxyphenyl-/3-hydroxy-propionyl SCoA (HMPHP SCoA) 
cleavage activity (enzyme activity III)" we include the provision of the 
enzyme activities in any suitable form to effect the said production of 
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vanillin. For example, as is discussed in more detail below, the method 
of the invention specifically includes, but is not limited to, the provision 
of the enzyme activities (a) by intact or permeabilised Ps. fluoresces 
biovar. V, strain AN103 or a mutant thereof, (b) at least one of enzyme 
5 activities II or III of which is in a form substantially free of cellular 
material, (c) by intact or permeabilised cells in culture, particularly 
microorganisms, which have been genetically modified to contain genes 
which encode enzyme activities II or III (for example, food grade 
microorganisms such as lactic acid bacteria and brewing yeast), and (d) by 
o plants which have been genetically modified to contain genes which 
encode said enzyme activities. 

It is preferred if means for converting vanillin to a non-vanillii, product 
is absent or reduced. Of course, the enzyme activity HI is not such a 
means. Conveniently, these enzyme activities are provided by the soil 
bacterium Pseudomonas fluorescent biovar. V, strain AN103 the said 
bacterium being that deposited under the Budapest Treaty at the National 
Collection of Industrial and Marine Bacteria Limited, AURIS Business 
Centre, 23 St. Machar Drive, Aberdeen AB2 1RY, Scotland under 
Accession No NCIMB 40783, or a mutant or variant thereof. By "mutant 
or variant thereof we include any mutant or variant of the said bacterium 
provided that the bacterium retains the said enzyme activities whether or 
not at the same levels. It will be appreciated that the said enzyme 
activities can be retained even if the genes encoding said enzymes are 
mutated. For example, mutants which constimtively express (as opposed 
to conditionally or inducibly express) the said enzyme activities are 
particularly useful mutants of Ps. fluoresces biovar. V, strain AN103, as 
are variants in which one or more of the enzymes with the said activities 
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exhibit more favourable kinetic characteristics (for example, an increased 
turnover number or a decreased KJ. 

When Ps. fluoresce™ biovar. V, strain AN 103 is growing on 
5 franj-feralate it will derive maximum benefit if vanillin is catabolised 
further in order to provide more energy. However, in order to maximise 
the production of vanillin by Ps. fluorescens biovar. V, strain AN 103 it 
is preferred that the means for converting vanillin to a non-vanillin 
product is absent or reduced. We have found that in Ps. fluorescens 
io biovar. V, strain AN103 vanillin is converted to vanillic acid or a salt 
thereof by vaniUin:NAD + oxidoreductase. It is preferred if a mutant of 
Ps. fluorescens biovar. V, strain AN103 wherein the vanillin:NAD + 
oxidoreductase activity is absent or reduced is used in the method. Such 
a mutant can be made using a gene replacement strategy with a disrupted 
1 5 vanillin: NAD oxidoreductase gene, or a sequence of DNA from which this 
gene has been deleted. Gene replacement is well known in the art of 
bacterial genetics. Alternatively, isolation of such a mutant may be 
achieved by classical chemical mutagenesis, selecting on the basis of 
inability to grow on vanillin. 

20 

It will be appreciated that there are other means for converting vanillin to 
a non-vanillin product and it is preferred if these are absent or reduced in 
the method. 

25 Although Ps. fluorescens biovar. V, strain AN103 or mutants or variants 
thereof themselves are useful in the method of the invention as whole cells 
or permeabilised or immobilised cells, it is preferred if the enzyme 
activities I, II and III are provided by an intact-cell-free system of Ps. 
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fluoresces biovar, V, stain AN103 or a mutant or variant thereof. 
Suitable systems and extracts may be used by methods well known in the 
art, for example by French pressure cell or sonication followed by 
centrifugation. Alternatively, whole cells may be permeabilised using 
methods well known in the art, for example using detergents such as 
dimethyl sulphoxide (DMSO). 

Using such an intact-cell-free system allows the necessary substrates and 
any cofactors to reach readily the relevant enzymes and for the products 
to be released readily into the reaction medium if this is necessary for 
further reaction; however as discussed below, at least some of the 
enzymes of the invention may be involved in substrate (metabolic) 
channelling. 

We have found that none of the enzyme activities I, II and III from Ps. 
fluoresces biovar. V, strain AN103 is dependent on NAD* whereas 
enzyme activity IV from Ps. fluoresces (vanillin: NAD + oxido-reductase) 
requires NAD + . 

Thus, a preferred way of reducing means for converting vanillin to a 
non-vanillin product in an intact-cell-free system of Ps. fluoresces biovar. 
V, strain AN103 (or in a cell-permeabilised system of Ps. fluoresces 
biovar. V, strain AN103) is to omit NAD* from the reaction system. 
Any exogenous NAD* is readily and rapidly depleted by the presence of 
tawu-ferulate in the system. 

For the microorganisms of the present invention which can be used in the 
method of vanillin production, including Ps. fluoresces biovar. V, strain 
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AN 103, at least three main types of bioreactor may be used for the 
biotransformation reactions: the batch tank, the packed bed and the 
continuous-flow stirred tank; their applications and characteristics have 
been reviewed (MD. Lilly in "Recent Advances in Biotechnology", eds. 
5 R Vardar-Sukan and S.S. Sukan, Kluwer Academic Publishers, 
Dordrecht, 1992, pp 47-68 and toe. dr.). 

As is described in more detail below, enzyme activities II and III are 
available free from other enzyme activities, for example directly or 
10 indirectly from Ps. fluorescein biovar. V, strain AN103 and from other 
organisms or cells which have been genetically modified to express genes 
encoding the said enzyme activities. 

It will be appreciated that other microorganisms will be found which will 
15 be useful in the methods of the invention, for example, by screening. 
Such microorganisms and methods of screening and methods of use form 
part of the invention. The method of screening for other microorganisms 
possessing activities I, II and III is essentially that already described in the 
Materials and Methods section in the Examples for the isolation of AN 
20 103. The important aspect is isolation from an environment rich in trans- 
ferulate or related compounds (eg 4-fra/u-coumarate, fraray-caffeate [3,4- 
dihydroxy-rra/w-cinnamate] which, as described below, may also be 
substrates for enzyme activity I) and selection for growth on fnmy-ferulate 
(preferably) as sole carbon source. In practice, preferred sources are those 
25 in which plant-derived materials are being degraded; in addition to soil or 
compost, this would include the outflow or residues from factories or 
other installations processing such materials - eg sugar-beet factories, 
cocoa fermentation heaps etc - and the contents of the gastro-intestinal 
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tract, particularly of ruminants and other herbivores. It is possible that 
anaerobes might be found possessing these activities and due account can 
readily be taken of this in the isolation procedure. A priori, isolation of 
organisms with these activities might also be possible from marine 
environments. 

Genera in which further microorganisms useful in the invention will be 
found include Pseudomonas, Arthrobacter, Alcaligenes. Acinetobacter, 
Flavobacterium, Agrobacterium, Rhizobium, Streptomyces, 
Saccharomyces, Penicillium and Aspergillus. 

An alternative or additional approach is to use any one of the 
Pseudomonas genes encoding enzyme activities II or HI described herein 
or redundant sequences designed from the Pseudomonas enzyme amino- 
acid sequences in DNA probes or PGR amplification strategies to find 
related genes in other organisms. As is made more clear below, enzymes 
and nucleotide sequences which are functionally equivalent to those of 
isolated from AN103 but which differ in sequence form pan of the 
invention. 

Our studies indicate that the enzyme which interconverts trans-ferula and 
rr^-feruloyl SCoA in Ps. fluoresces biovar. V, strain AN103 uses 
Coenzyme A (CoASH), ATP and Mg- or other functionally equivalent 
cofactors. Thus, it is preferred that the method further comprises the step 
of (3) providing any one of the cofactors CoASH, ATP or Mg 2+ or other 
functionally equivalent cofactors. ATP is adenosine triphosphate. It is 
well known that other functionally equivalent cofactors can substitute in 
some cases for CoASH, ATP or Mg^. For example Mn- may be used 
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in place of Mg 2+ and derivatives or analogues of ATP, preferably with a 
hydrolysable 7-phosphate, may be used in place of ATP. 

We have also determined that, at least when the enzyme activity I is 
5 provided by the Pseudomonas AN103 enzyme which interconverts 
rr<ws-ferulate and taww-feruloyl SCoA and which enzyme uses ATP and 
Coenzyme ASH, it is convenient to include a system wherein either one, 
or both, of the cofactors Coenzyme ASH and ATP is recycled. The 
following ATP generation and CoASH recycling systems are preferred. 

10 

ATP generation: - 

(i) trans-Ferulate + CoASH + ATP + H z O -» Vanillin + Acetyl 
SCoA + AMP + PPi (overall reaction catalysed by Ps. fluorescens 

15 biovar. V, strain AN103 extract) 

(ii) AMP + ATP *± 2 ADP (adenylate kinase) 

(iii) Acetyl - P + ADP ^ Acetate + ATP (acetate kinase) 

(iv) Sum: trans-Ferulate + CoASH + 2 Acetyl ~ P + H 2 0 
Vanillin + Acetyl SCoA + 2 Acetate + PPi 

20 

CoASH recycling is achievable using commercially-available citrate 
synthase (EC 4.1.3.7) and citrate lyase (EC 4.1.3.6), viz:- 

(v) Acetyl SCoA + Oxaloacetate + H 2 0 Citrate + CoASH (citrate 
2s synthase) 

(vi) Citrate Oxaloacetate + Acetate (citrate lyase) 



Overall sum, (iv)-(vi): 
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(vii) rro^-Ferulate + 2 Acetyl ~ P + 2 H 2 0 -» Vanillin + 3 Acetate 
+ PPi 



The acetyl ~ P used in the overall process, (vii), would not itself be 
5 generated by enzymic means; however, none of its atoms would appear 
in the vanillin product. 

Acetyl phosphate is commercially available or can be synthesised using the 
method described by Stadtman (1957) Meth. Enzvmol. 3, 228-231. 



10 



15 



The reagents are commercially available from, for example Sigma 
Chemical Co, Fancy Road, Poole, Dorset, UK. Citrate lyase is typically 
from Enterobacter aerogenes; citrate synthase is typically from chicken 
heart, pigeon breast muscle or porcine heart. 

Thus, it is preferred if coenzyme ASH is recycled using the enzymes 
citrate synthase and citrate lyase; and it is preferred if the ATP is 
generated using the enzymes adenylate kinase (EC 2.7.4.3) and acetate 
kinase (EC 2.7.2.1). 

The co-factor recycling system is particularly preferred when using an 
intact-cell-free system. 



Wferulic acid or a salt thereof is readily available from plant material. 
25 Suitably, mwMerulic acid or a salt thereof is released from the plant 
material by the action of ferulic acid esterase. Thus, in a particularly 
preferred embodiment of the invention the rraw-ferulic acid or salt thereof 
is provided by the action of ferulic acid esterase on plant material. 



20 
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Trans-femlic acid and /ra/w-4-couraaric acid can together represent up to 
1.5% by weight of the cell walls of temperate grasses (R.D. Hartley and 
E.C. Jones, Phytochemistry 16, 1531-1534 (1977)). 7>a/is-ferulic acid is 
reported to comprise 0.5% (w/w) of wheat bran (M. C. Ralet, J.-F. 
5 Thibault and G. Delia Valle, J. Cereal Sci. 11, 249-259 (1990)), 3.1 % of 
maize bran (L. Saulnier, C. Marot, E. Chanliaud and J.-F. Thibault, 
Carbohydr. Polym. 26, 279-287 (1995)) and 0.8% of sugar beet pulp (V. 
Micard, G.M.G.C. Renard and J.-F. Thibault, Lebensm.-Wiss. u-Technol. 
27, 59-66 (1994)). These materials are amongst the preferred sources of 

10 w-a/ir-ferulic acid. Since r/wu-ferulic acid is present esterified with cell- 
wall polysaccharides, hydrolysis is essential. Alkaline or acid hydrolysis 
is possible, but enzymic hydrolysis is preferred. Typically, the initial step 
is the partial enzymic hydrolysis of carbohydrates (arabinans, xylans, 
rhamnogalacturanans) to which fra/u-ferulate is linked, followed by the 

15 release of rra/is-ferulate from the oligosaccharide fragments by trans- 
ferulic acid esterase activity. In practice, both steps may occur 
simultaneously in the reaction mixture. Descriptions of representative 
laboratory-scale processes are available in the literature (for example see 
L.P. Christov and B.A. Prior, Enzyme Microb. Technol. 15, 460-475 

20 (1993)); C.B. Faulds and G. Williamson, Appl. Microbiol. Biotechnol. 43, 
1082-1087 (1995); C.B. Faulds, P.A. Kroon, L. Saulnier, J.-F. Thibault 
and G. Williamson, Carbohydrate Polymers 27, 187-190 (1995)). 
Phenolic acid-releasing enzymes have been reported from a number of 
microorganisms, including Streptomyces olivochromogenes (C.B. Faulds 

25 and G. Williamson, J. Gen. Microbiol. 137, 2337-2345 (1991)), 
Penicillium pinophilum (A. Castanares, S.I. McCrae and T.M. Wood, 
Enzyme Microb. Technol. 14, 875-884(1992)), Neocallimastix spp. (W.S. 
Borneman, R.D. Hartley, W.H. Morrison, D.E. Akin and L.G. 
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Ljungdahl, Appl. MicrobioL Bioiechnol. 33, 345-351 (1990)), 
Schizophyllum commune (R.C. MacKenzie and D. Bilous, Appl. Envir. 
MicrobioL 54, 1170-1173 (1988)) and Aspergillus spp. (M. Tenkanen, J. 
Schuseil, J. Puis and K. Poutanen, /. BiotechnoL 18, 69-84 (1991); C.B. 
Faulds and G. Williamson, Microbiology 140, 779-787 (1994)). A trans- 
it acid esterase (XYLD) has been characterised from Pseudomonas 

fluorescent subsp. cellulosa, together with an arabinofuranosidase(XYLC) 
and an endoxylanase (XYLB; L.M.A. Ferreira, T.M. Wood, G. 

Williamson, C.B. Faulds, G.P. Hazlewood and H.J. Gilbert, Biochem. J. 

294, 349-355 (1993)). The genes for all three enzymes have been isolated 

(G.P. Hazlewood and H.J. Gilbert, in "Xylans and Xylanases", eds. J. 

Visser, G. Beldman, M.A. Kusters-van Someren and A.G.J. Voragen, 

Elsevier, Amsterdam, pp 259-273 (1992)). All of these references are 

incorporated herein by reference. 

Thus, advantageously the trans-kmWc acid or a salt thereof may be 
provided by the action of /ra/w-ferulic acid esterase on said ester. More 
particularly, it is advantageous to introduce a gene encoding said esterase 
into a host cell or organism which is being used in the methods of the 
invention. Thus, it is convenient to introduce a /ra/w-ferulic acid esterase 
gene, such as the aforementioned XYLD gene, into a plant which is being 
used in the methods of the invention. 

Although, as described above, the method may be performed using 
enzyme activities I, II and III which are provided by Ps. fluoresces 
biovar. V, strain AN103 or mutants or variants thereof themselves, or 
intact-cell-free extracts thereof, it is preferred if at least one of the enzyme 
activities II and HI is provided by a substantially purified enzyme. 
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Substantially purified enzymes with enzyme activities II and III are 
described below. 

In a particularly preferred embodiment of the invention the method of the 
5 first aspect of the invention further comprises providing a compound, in 
addition to /ra/w-feruiic acid or a salt thereof, which may be converted by 
any one of enzyme activities I, II or III into a desirable product. Suitably 
said compound is converted by any one or more of said enzyme activities 
into a product which is found in, and preferably contributes to the taste or 
10 aroma of, vanilla as extracted from Vanilla pod. 

Vanilla as extracted from Vanilla pod contains vanillin as a major 
component but also smaller quantities of desirable components such as p- 
hydroxybenzoic acid, p-hydroxybenzaldehyde and vanillic acid. Typically 
15 these components, and vanillin, are present as glucosides in green vanilla 
pods as well as in the free form. However, upon hydrolysis or 
fermentation of the green pods or of hydrolysis of the fermented pods, 
most of the components are present in the free form. 

20 Thus, it is particularly preferred if said compound is any one of trans-4- 
coumaric acid or a salt thereof, rra/ur-4-coumaroyl SCoA, /roay-caffeic 
acid or a salt thereof, tains-caffeoyl SCoA, or 3,4-methylenedioxy-//my- 
cinnamic acid or a salt thereof. By the action of one or more of enzyme 
activities I, II or III rrafw-4-coumaric acid or a salt thereof and trans A- 

25 coumaroyl SCoA are converted to p-hydroxybenzaldehyde; /ra/w-caffeic 
acid or a salt thereof and fro/zs-caffeoyl SCoA are converted to 3,4- 
dihydroxybenzaldehyde; and 3,4-methylenedioxy-rran5-cinnamic acid or 
a salt thereof is converted to heliotropin. 
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It is preferred if the compound is ^-4-coumaric acid or a salt thereof 
or rra^-^umaroyl SCoA and that the desirable product is 4- 
hydroxybenzaldehyde which is a significant component of natural Vanilla 
extract. 



The enzyme activities I, II and III from ft. fluorescent biovar V, strain 
AN103 are able to use ^-caffeate and /ra^-coumarate, (and, as 
appropriate, the products of their reaction with enzyme activity I) with 
reasonable efficiency whereas cinnamate and 3,4-methyienedioxy-^- 
cinnamate, although may be used as substrates, are poor substrates of the 
AN103 enzymes. 

Thus, the method of the first aspect of the invention is suited to make 
vanilla flavourings and aromas which more closely resemble the vanilla 
from Vanilla pod. 

The method of the first aspect of the invention may, in certain 
circumstances, also be performed using the host cells and genetically 
modified cells and organisms as described below in more detail. 

A second aspect of the invention provides a method of producing vanillic 
acid, or a salt thereof, comprising the steps of 

(1) providing rra/u-ferulic acid or a salt thereof; 

(2) Providmgrro^-femlate^ 

hydratase activity, and ^hydroxy-S-methoxyphenyl./S-hydroxy- 
propionyl SCoA (HMPHP SCoA) cleavage activity; and 

(3) providing an activity that interconverts vanillin and vanillic acid. 
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For convenience, the activity that interconverts vanillin and vanillic acid 
is called enzyme activity IV. Conveniently the activity is provided by 
vanillin:NAD+ oxidoreductase (vanillin dehydrogenase). Suitably, this 
activity is provided by Ps. fluoresces biovar. V, strain AN103. Methods 

5 of converting vanillin to vanillic acid or a salt thereof are also known in 
the art, for example Perestelo et al (1989) App. Environ. Microbiol. 55, 
1660-1662 describes the production of vanillic acid from vanillin by 
resting cells of Serratia marcescens and Pomelto & Crawford (1983) App. 
Environ. Microbiol. 45, 1582-1585 describe whole-cell bioconversion of 

10 vanillin to vanillic acid by Streptomyces viridosporus. 

The method of producing vanillic acid provided by the invention therefore 
includes the steps of exposing rrow-ferulic acid or a salt thereof to enzyme 
activity I and forming a product, exposing the said product of enzyme 
activity I to enzyme activity II to form a product, exposing the said 
product of enzyme activity II to enzyme activity III to form a product, and 
exposing the said product of enzyme activity III to enzyme activity IV to 
form a product. 

It will be appreciated that vanillic acid can be made by the same means as 
vanillin is made in the method of the first aspect of the invention 
provided, of course, that enzyme activity IV is supplied. 

A further preferred embodiment of the first aspect of the invention 
comprises the further step of separating vanillin from the other reaction 
components. 
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Vanillin, and other aromatic aldehydes, are, for example, recoverable by 
extraction with solvent, including supercritical carbon dioxide, and by 
organophilic pervaporation, using membranes constructed of hydrophobic 
polymers (G. Bengston and K.W. Bodekker, in "Bioflavour 95\ eds. P. 
Etievant and P. Schreier, INRA, Paris, pp 393-403 (1995); S.M. Zhang 
and E. Drioli, Separ. Sci. Technol. 8, 1-31 (1994)); pervaporation 
technology has been applied, for example, to the recovery of flavour 
compounds of wine (N. Rajagopalan and M. Cheryan, J. Membrane Sci. 
104, 243-250 (1995)). Solid-phase extraction, followed by desorption 
with solvent, is also possible, though less preferred. 

However, in some circumstances, particularly where minor reaction 
products are present which are similar to compounds present in the vanilla 
isolated from Vanilla pod, vanillin is not isolated. 

A further preferred embodiment of the second aspect of the invention 
comprises the further step of separating vanillic acid or a salt thereof from 
the other reaction components. 

Vanillic acid and other carboxylic acids may, for example, be recovered 
by solid-phase extraction, by solvent extraction under acidic conditions, 
or by pertraction; for example, L. Boyadzhiev and I. Atanassova (Process 
Biochemistry 29, 237-243 (1994)) describe the recovery of the aromatic 
amino acid, phenylalanine, by pertraction. 

A third aspect of the invention provides Pseudomonas fluoresces biovar. 
V, strain AN103 as deposited under the Budapest Treaty at the National 
Collections of Industrial and Marine Bacteria Limited, AURIS Business 
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Centre, 23 St Machar Drive, Aberdeen AB2 1RY, Scotland under 
Accession No NCIMB 40783, or a mutant or variant thereof. Preferred 
mutants and variants are the same as those preferred in the first aspect of 
the invention. A particularly preferred mutant of Ps. fluorescent biovar. 
5 V, strain AN103 is one which accumulates vanillin when provided with 
trans- femlic acid or a salt thereof. Conveniently, this is a mutant of Ps. 
fluorescent biovar. V, strain AN103 wherein vanilIin:NAD + 
oxidoreductase activity is absent or reduced. Suitably, there is a mutation 
in the gene encoding vanillin:NAD + oxidoreductase such that the enzyme 
10 activity is absent or substantially reduced. Such a mutant can be made 
as described above. 

A fourth aspect of the invention provides a polypeptide which, in the 
presence of appropriate cofactors if any, is capable of catalysing the 

15 interconversion of /ra/u-feruloyl SCoA and 4-hydroxy-3-methoxy- 
phenyl-j3-hydroxypropionyl SCoA (HMPHP SCoA). Such a polypeptide 
has enzyme activity II. Conveniendy, the polypeptide comprises 
rra/u-feruloyl SCoA hydratase; more conveniently the polypeptide 
comprises taws-feruloyl SCoA hydratase from Ps. fluorescens biovar. V, 

20 strain AN103 or fragments or variants thereof which have at least 1 % of 
the specific activity of the native enzyme (in relation to rra/w-feruloyl 
SCoA hydratase activity) , preferably at least 10%, more preferably at least 
100%. 

25 The enzyme activity is readily purified as described in the Examples. 
Modifications to this procedure may be readily made by the person skilled 
in the art so that a polypeptide with enzyme activity II can be obtained 
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from any suitable source" making use of the enzyme activity II assay 
procedure described in the Examples. 

It is preferred if the polypeptide of the fourth aspect of the invention 
comprises the amino acid sequence. 



MetSerThrTyrGluGlyArgTrpLysThrValLysValGluIleGluAspGlylleAla 

PheVallleLeuAsnArgProGluLysArgAsxxAlaMetSerProThrLeuAsnArgGlu 

MetlleAspValLeuGluThrLeuGluGlnAspProAlaAlaGlyValLeuValI.euThr 

GlyAlaGlyGluAlaTrpThrAlaGlyMetAspLeuLysGluTyrPheArgGluValAsp 

Al a GlyP r oGluIleLeuGlnGluLysIleArgArgGluAla S erGlnTrpGlnTrpI.ys 

Le^euArgMetTyrAlaLysProThrlleAlaHetValAsnGlyTrpCysPheGlyGly 

GlyPheSerPro L euValAlaCysA S pLeuAlaIleCysAlaAspGluAlaThrPheGly 

LeuSerGluIleAsnTrpGlylleProProGlyAsnLeuValSerLysAlaMetAlaAsp 

ThrValGlyHisArgGlnSerLeuTyrTyrlleMetThrGlyLysThrPheGlyGlyGln 

LysAiaAlaGluMetGlyLeuValAsnGluSerValProI.euAlaGlnLeuArgGluVal 

ThrHeGluLeuAlaArgAsnLeuLeuGluLysAsnProValValLeuArgAlaAlaLys 

HisGlyPheLysArgCysArgGluLeuThrTrpGluGlnAsnGluAspTyrLeuTyrAla 

LysLeuAspGlnSerArgI.euLeuAspThrGluGlyGlyArg e iuGlnGlyMetLy S Gl„ 
PheLeuAspAspLysSerlleLysProGlyi.euGlnAlaXVrLysArg (SEQ ID No 2) , 



or a fragment or variant thereof. 

The amino acid sequence is that given in Figure 12 as that encoded by 
nucleotides 2872 to 3699. 

A fifth aspect of the invention provides a polypeptide which, in the 
presence of appropriate cofactors if any, is capable of catalysing the 



WO 97/35999 PCT/GB97/0G8O9 

22 

interconversion of 4-hydroxy-3-methoxyphenyl-j3-hydroxy-propionyl 
SCoA (HMPHP SCoA) and vanillin. Such a polypeptide has enzyme 
activity III. Conveniently, the polypeptide comprises HMPHP SCoA 
cleavage enzyme; more conveniendy the polypeptide comprises HMPHP 
5 SCoA cleavage enzyme from Ps. fluorescens biovar. V, strain AN103 or 
fragments or variants thereof which have at least 1% of the specific 
activity of the native enzyme (in relation to HMPHP SCoA cleavage 
activity), preferably at least 10%, more preferably at least 100%. 



io The enzyme activity is readily purified as described in the Examples. 
Modifications to this procedure may be readily made by the person skilled 
in the art so that a polypeptide with enzyme activity III can be obtained 
from any suitable source making use of the enzyme activity III assay 
procedure described in the Examples. 

15 

It is preferred if the polypeptide of the fifth aspect of the invention 
comprises the amino acid sequence 



MetSerThrTyrGluGlyArgTrpLysThrValLysValGluIleGluAspGlylleAla 
20 PheVallleLeuAsnArgProGluLysArgAsnAlaMetSerProThrLeuAsnArgGlu 
MetlleAspValLeuGluThrLeuGluGlnAspProAlaAlaGlyValLeuValLeuThr 
GlyAlaGlyGluAlaTrpThrAlaGlyMetAspLeuLysGluTyrPheArgGluValAsp 
AlaGlyProGluIleLeuGlnGluLysIleArgArgGLuAlaSerGlnTrpGlnTrpLys 
LeuLeuArgMetTyrAlaLysProThrlleAlaMetValAsnGlyTrpCysPheGlyGly 
25 GlyPheSerProLeuValAlaCysAspLeuAlalleCysAlaAspGluAlaThrPheGly 
LeuSerGLuIleAsnTrpGlylleProProGlyAsnLeuValSerLysAlaMetAlaAsp 
ThrValGlyHisArgGlnSerLeuTyrTyrlleMetThrGlyLysThrPheGlyGlyGln 
LysAlaAlaGluMetGlyLeuValAsnGluSerValProLeuAlaGlnLeuArgGluVal 
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ThrlleGluLeuAlaArgAsnLeuLeuGluLysAs 



HisGlyPheLysArgCysArgGluLeuThrTrpGl 



nProValValLeuArgAlaAlaLys 

uGlnAsnGluAspTyrLeuTyrAla 
L ysLeuA S p G l nSerArgL euI.eu AS p ThrGluGlyGlyArgGluGlnGlyMetLysGin 

PheLeuAspAspLysSerlleLysProGlyLeuc^^^^ (SEQ m ^ ^ 

or a fragment or variant thereof. 

The amino acid sequence is that given in Figure 12 as that encoded by 
nucleotides 2872 to 3699. 



A sixth aspect of the invention provides a polypeptide comprising the 

amino acid sequence 



Me.I.eu^pValProLeuLeuIleGlyGlyGlnSerCysProAlaArgAspGlyArgThr 

PheGluArgArgAsnProValThrGlyGluLeuValSerArgValAlaAlaAlaThr^eu 

GluAspAlaAspAlaAlaValAlaAlaAlaGlnGlnAlaPheP.oAlaTrpAlaAiaLeu 
AlaProAsnGluAra^Ser*^^ 

GlyGluPhelleGluAlaAlaGly^^^ 

ValArgLeuAlaAlaAsnMetLeuArgGluAlaAlaSerMetThrThrGlnVaiAsnGiy 

GluVallleProSerAspValProGlySerPheAlaMetAlal.euArgGlnProCysGly 
ValValLeuGlyIleA 1 aProTrpAsnAlaProVal I le^ uAlaThrArgAlalleAla 

MetProI.euAlaCysGlyAsnrhrValVall.uLysMase^,^^^^^ 

HisArgLeuIleGlyGlnValLeuGlnAspAlaGlyLeuGlyAspGlyValValAsnVal 
IleSerAsnAlaProAlaAspAlaAlaGlnlleVaiGluArgLeulleAlaAsnProAla 
ValArgArgValAsnPheThrGlySerThrHisValGiyArglleValGlyGWuse, 
AlaArgHisLeuLysProAlaLeuLeuGluLeuGlyGlyLysAiap^ 



5uLe U ValLeu 
laTyrPheAsnGln 

GlyGlnlleCysMetSerThrGluArgLeuIleValAspAl, 



AspAspAlaAspLeuGluAlaAlaValGlnAlaAlaAlaPheGlyAlc 



LaLysValAlaAspAlaPhe 
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ValAlaGlnLeuAlaAlaLysValGluThrLeuArgAlaGlyAspProAlaAspProGlu 

SerValLeuGlySerLeuValAspAlaSerAlaGlyThrArglleLysAlaLeuIleAsp 

AspAlaValAlaLysGlyAlaArgLeuVallleGlyGlyGlnLeuGluGlySerlleLeu 

GlnProThrLeuLeuAspGlyValAspAlaSerMetArgLeuTyrArgGluGluSerPhe 

5 GlyProValAlaValValLeuArgGlyGluGlyGluGluAlaLeuLeuGlnLeuAlaAsn 

AspSerGluPheGlyLeuSerAlaAlallePheSerArgAspThrGlyArgAlaLeuAla 

LeuAlaGlnArgValGluSerGlylleCysHisIleAsnGlyProThrValHisAspGlu 

AlaGlnMetProPheGlyGlyValLysSerSerGlyTyrGlySerPheGlyGlyLysAla 

SerlleGluHisPheThrGlnLeuArgTrpValThrLeuGlnAsnGlyProArgHisTyr 
10 Prolle 

(SEQ ID No 4) or a fragment or variant thereof. 

The amino acid sequence is that given in Figure 12 as that encoded by 
15 nucleotides 3804 to 5249. 

As described in detail in the examples, this polypeptide sequence encodes 
an enzyme with vanillin: NAD + oxidoreductase activity from Ps. 
fluorescens biovar V., strain AN103. 

20 

By "variants" we include deletions, insertions and substitutions either 
conservative or non-conservative, where such changes may reduce or 
enhance the activity, or may not substantially alter the activity. In 
particular, the seventh aspect of the invention includes the complete 
25 polypeptide sequence of Ps, fluorescens biovar V, strain AN 103 
vanillin: NAD + oxidoreductase and this polypeptide itself. 
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By "conservative substitutions" is intended combinations such as Gly, Ala; 
Val, He, Leu; Asp, Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. 
Such variants may be made using the methods of protein engineering and 
site-directed mutagenesis as described below and as is well known in the 
art. 



A preferred embodiment of the invention is a polypeptide as defined in the 
fourth, fifth or sixth of the invention which is substantially pure. 

By "substantially pure" we mean that the polypeptide is substantially free 
of other polypeptides, or other macromolecules, with which it is usually 
found in nature. Suitably, the polypeptide is substantially free of any 
other polypeptides or macromolecules. It is preferred if the polypeptide 
has less than 50% by weight of any other polypeptide, preferably less than 
10%, more preferably less than 1 %, still more preferably less than 0. 1 % 
and most preferably less than 0.01 %. 

Polypeptides can be purified using methods known in the an. It is 
preferred if the polypeptide is the product of a recombinant DNA. 

A single polypeptide chain may comprise more than one of the enzyme 
activities (ID iftmr-fciuloyl SCoA hydratase activity or (III) HMPHP 
SCoA cleavage activity. 

Our data in the Examples shows that, in the case of ft. fluoresces 
biovar. V, strain AN103, enzyme activities II and III are found in the 
same polypeptide chain, the sequence of which is given as the preferred 
polypeptides of the fifth and sixth aspects of the invention. Thus, when 
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enzyme activities II and III are provided in any aspect of the invention it 
is most convenient if they are provided in the same polypeptide chain. 

It will be appreciated that, using protein engineering methods or chemical 
cross-linking it may be possible to produce a single molecule which has 
enzyme activities II and in. Such a molecule, therefore, forms a further 
aspect of the invention. 

An seventh aspect of the invention provides a polynucleotide encoding a 
polypeptide as defined in any one of the fourth, fifth or sixth aspects of 
the invention. 

By "polynucleotide" we include RNA and DNA. DNA is preferred. 

Thus, this aspect of the invention provides a polynucleotide which encodes 
any one of a polypeptide which has (II) /ra/tr-feruloyl SCoA hydratase 
activity; (III) HMPHP SCoA cleavage activity; (IV) vani!lin:NAD + 
oxidoreductase activity; or a polypeptide which has more than one of these 
activities. Preferably the polynucleotide is derived from Ps. fluoresce™ 
biovar. V, strain AN 103. A preferred polynucleotide comprises all or ai 
least a part of the Ps. fluoresce™ DNA contained within the cosmid clone 
pFI793 as deposited under the Budapest Treaty at the National Collections 
of Industrial and Marine Bacteria Limited, AURIS Business Centre, 23 St 
Machar Drive, Aberdeen AB2 1RY, Scodand under Accession No 
NCIMB 40777, or a fragment or variant thereof. 

The isolation of the cosmid clone pFI 793 is described in Example 5; pFI 
793 includes DNA which encodes polypeptides which have enzyme 
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activities n. ID and IV.' The cosmid clone pFI 793 itself, the genes 
contained in the Ps. fluorescent DNA thereof, and variants thereof form 
separate aspects of the invention. 

> A variant of a polynucleotide Eludes any insertion, deletion or 
substation of the seqnence whi ch encodes a fragment or variant of a 
polypeptide as defined above. 

For example, site-directed mutagenesis or other techniques can be 
» employed to create single or multiple mutations, such as replacements, 
insertions, deletions, and transpositions, as described in Botstein and 
Shortle, Strategies and Applications of A Vitro Mutagenesis, Science, 229: 
193-210 (1985), which is incorporated herein by reference. Since such 
modified polynucleotides can be obtained by the application of known 
techniques to the teachings contained herein, such modified 
polynucleotides are within the scope of the claimed invention. 

Moreover, it will be recognised by those skilled in the an that the 
polynucleotide sequence (or fragments thereof) of the invention can be 
used to obtain other DNA sequences that hybridise with it under 
conditions of high stringency. Such DNA includes any genomic DNA. 

Accordingly, the polynucleotide of the invention includes DNA that shows 
at least 55 per cent, preferably 60 per cent, and most preferably 70 per 
cent homology with the polynucleotide sequences identified in the 
invention, provided that such homologous DNA encodes a protein which 
is usable in the methods described herein. 
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DNA-DNA, DNA-RNA and RNA-RNA hybridisation may be performed 
in aqueous solution containing between 0.1XSSC and 6XSSC and at 
temperatures of between 55°C and 70°C. It is well known in the art that 
the higher the temperature or the lower the SSC concentration the more 
5 stringent the hybridisation conditions. By high stringency we mean 
2XSSC and 65 °C. 1XSSC is 0.15M NaCl/0.015M sodium citrate. 

"Variants" of the polynucleotide include polynucleotides in which 
relatively short stretches (for example 20 to 50 nucleotides) have a high 
10 degree of homology (at least 50% and preferably at least 90 or 95 %) with 
equivalent stretches of the polynucleotide of the invention even though the 
overall homology between the two polynucleotides may. be much less. 
This is because important active or binding sites may be shared even when 
the general architecture of the protein is different. 

15 

A particularly preferred polynucleotide comprises the nucleotide sequence 

ATGAGCACATACGAAGGTCGCTGGAAAACGGTCAAGGTCGAAATCGAAGACGGCATCGCG 
TTTGTCATCCTCAATCGCCCGGAAAAACGCAACGCGATGAGCCCGACCCTG7VACCGCGAG 

20 ATGATCGATGTTCTGGAAACCCTCGAGCAGGACCCTGCCGCCGGTGTGCTGGTGCTGACC 
GGTGCGGGCGAAGCCTGGACCGCAGGCATGGACCTCAAGGAATACTTCCGCGAAGTGGAC 
GCCGGCCCGGAAATCCTCCAGGAAAAAATCCGCCGCGAAGCCTCGCAATGGCAATGGAAA 
CTGCTGCGCATGTACGCCAAGCCGACCATCGCCATGGTC71ATGGCTGGTGCTTCGGCGGC 
GGTTTCAGCCCGCTGGTGGCCTGCGACCTGGCGATCTGCGCCGACGAAGCAACCTTCGGT 

25 CTCTCGGAAATCAACTGGGGTATCCCGCCGGGCAACCTGGTGAGCAAGGCCATGGCCGAC 
ACCGTGGGCCACCGCCAGTCGCTCTACTACATCATGACCGGCAAGACCTTCGGTGGGCAG 
AAAGCCGCCGAGATGGGCCTGGTCAACGAAAGCGTGCCCCTGGCGCAACTGCGCGAAGTC 
ACCATCGAGCTGGCGCGTAACCTGCTCGAAAAAAACCCGGTGGTGCTGCGTGCCGCCAAA 
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CACGGTTTCAAACGCTGCCGCGAACTGACCTGGGAGCAGAACGAGGATTACCTGTACGCC 

AAGCTCGATCAGTCGCGTTTGCTGGACACCGAAGGCGGTCGCGAGCAGGGCATGAAGCAATTCC 
TCGACGACAAGAGCATCAAGCCTGGCCTGCAAGCGTATAAACGC (SEQ ID No 1, 

(as given in Figure 12, nucleotides 2872 to 5249) or a fragment or variant 

thereof. 

This polynucleotide encodes HMPHP SCoA cleavage enzyme activity and 
a /ra^-feruloyl SCoA hydratase activity from Ps. fluoresces biovar. V, 
strain AN103 and encodes the preferred polypeptide of the fifth and sixth 
aspects of the invention. 

A further particularly preferred polynucleotide comprises the nucleotide 
sequence 



ATGCTGGACGTGCCCCTGCTGATTGGCGGCCAGTCGTGCCCCGCGCGCGACGGTCGAACC 
TTCGAGCGCCGCAACCCGGTGACTGGCGAGTTGGTGTC 

GAAGATGCC^CGCCGCCC^CCGCTGCCCAGCAAGCGTTTCCCGC^GCCGCGCTG 
GCGCCCAATGAAC(^CAGCCGTTTGCTCAAGGCCGCC<^ 

GGCGAGTTC^TCGAGGCGGCGGGCGAGACCGGCGCCATGGCCAACTGGTACGGGTTCAAC 
GTACGGCTGGCGGCCAACATGCTGCGTGAA.GCGGCATCGATGAC(^CCCAGGTCAATGGT 
GAAGTGATTCCCTCGGACGTTCCCGGCAGTTTCGCCATGGCCCTGCGCCAGCCCTGTGGC 
GTGGTGCTGGGCATCGCCCCCTGGAACGCCCCGGTGATTCTCGCCACCCGGGCGATTGCC 
ATGCCGCTGGCCTGTGGCAACACCGTGGTGCTGAAGGCTTCCGAGCTGAGTCCG^ 

CATCGCTrGATCGGCCAGGTGCTGCAGGACGCCGGCCTGGGCGATGGCGTGGTCAACGTC 
ATCAGTAATGCGCCGGCGGATGCGGCACAGATTGTCGAGCGCCTGATTGCCAACCCGGCC 
GTACGCCGGGTCAATTTCACCGGTTCGACCCACGTCGGGCGCATTGTCGGCGAGCTCTCG 
GCGCGCCACCTCAAACCGGCGTIX3CTCGAGCTGGGCGGCAAGGCACCGTTGCTGGTGCTC 
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GACGATGCCGACCTGGAGGCTGCCGTCCAGGCGGCGGCGTTTGGCGCCTACT 
GGACAGATCTGTATGTC(^CCGAGCGCCTGATTGTCGATGCCAAGOT 
GTCGCCCAGTTGGCGGCCAAGGTCGAGACCCTGCGCGCCGGTGATCCTGCCGACCCGGAG 
TCGGTGCTCGGTTCGCTGGTGGACGCCAGCGCn3GCACGCGGATCAAAGC 
5 GATGCCGTGGCCAAGGGCGCGCGCCTX^TAATCGGCGGGCAACTGGAGGGCA^ 
CAGCCGACCCH^CTCGACGGTGTCGACGCGAGCATGCGT^ 

GGCCXTGGTGGCGGTGGTGCTGCGCGGCGAGGGCGAAGAAGCGCTGTTGCAACTGG 
GACTCCGAGTTCGGTTTGTCGGCGGCGATTTTCAGTCGTGACACCGGCCGTGCCCT 
CTGGCCCAGCGGGTCGAATCGGGCATCTGCCACATCAACGGCCCGACCGTGCACGACGAA 
10 GCGCAAATGCCTTTTGGCGGGGTCAAGTCCAGC^ 
TCGATTGAGCATTTCACTCAGTTGCGCTGGCT 
CCGATC 

(SEQ ID No 3) (as given in Figure 12, nucleotides 3804 to 5249) 

15 

or a fragment or variant thereof. This polynucleotide encodes the 
polypeptide sequence of the sixth aspect of the invention. It is particularly 
convenient to isolate the whole gene from cosmid clone pFI 793 as 
deposited under the Budapest Treaty at NCIMB under Accession No 
20 NCIMB 40777. 

The polynucleotides of the invention are all readily isolated from pFI 793 
by probing with the given sequences or parts thereof, or by other methods 
known in the art, and the nucleotide sequences can be confirmed by 
25 reference to the deposited cosmid (pFI 793). 



It will be appreciated that fragments and variants of the polynucleotides 
of the invention can readily be made by the person skilled in the art using 
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standard molecular biological methods such as those described in 
Sambrook et al "Molecular Cloning, a laboratory manual", (1989), (2nd 
Edition), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York. The whole gene and variants and fragments thereof are specifically 
included in this aspect of the invention. 

It is preferred if the polynucleotide, conveniently DNA, is joined to a 
nucleic acid vector. 



DNA constructs of the invention may be purified from the host cell using 
well known methods. 



For example, plasmid vector DNA can be prepared on a large scale from 
cleaved lysates by banding in a CsCl gradient according to the methods of 
Ciewell & Helinski (1970) Biochemistry 9, 4428-4440 and Clewell (1972) 
J. Bacterial. 110, 667-676. Plasmid DNA extracted in this way can be 
freed from CsCl by dialysis against sterile, pyrogen-free buffer through 
Visking tubing or by size-exclusion chromatography. 

Alternatively, plasmid DNA may be purified from cleared lysates using 
ion-exchange chromatography, for example those supplied by Qiagen 
(Chatsworth, CA, USA). Hydroxyapatite column chromatography may 

also be used. 

The DNA is then expressed in a suitable host to produce a polypeptide of 
the invention. Thus, a DNA encoding a polypeptide of the invention may 
be used in accordance with known techniques, appropriately modified in 
view of the teachings contained herein, to construct an expression vector, 
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which is then used to transform an appropriate host cell for the expression 
and production of the polypeptide of the invention. Such techniques 
include those disclosed in US Patent Nos. 4,440,859 issued 3 April 1984 
to Rutter et al, 4, 530,901 issued 23 July 1985 to Weissman, 4,582,800 
5 issued 15 April 1986 to Crowl, 4,677,063 issued 30 June 1987 to Mark 
et al, 4,678,751 issued 7 July 1987 to Goeddel, 4,704,362 issued 3 
November 1987 to Itakura et al, 4,710,463 issued I December 1987 to 
Murray, 4,757,006 issued 12 July 1988 to Toole, Jr. et al, 4,766,075 
issued 23 August 1988 to Goeddel et al, and 4,810,648 issued 7 March 
10 1989 to Stalker, ail of which are incorporated herein by reference. 

The DNA encoding the polypeptide constituting the compound of the 
invention may be joined to a wide variety of other DNA sequences for 
introduction into an appropriate host. The companion DNA will depend 
15 upon the nature of the host, the manner of the introduction of the DNA 
into the host, and whether episomal maintenance or integration is desired. 

Generally, the DNA is inserted into an expression vector, such as a 
plasmid, in proper orientation and correct reading frame for expression. 

20 If necessary, the DNA may be linked to the appropriate transcriptional and 
translational regulatory control nucleotide sequences recognised by the 
desired host, although such controls are generally available in the 
expression vector. The vector is then introduced >to the host through 
standard techniques. Generally, not all of the hos^ will be transformed 

25 by the vector. Therefore, it will be necessary to select for transformed 
host cells. One selection technique involves incorporating into the 
expression vector a DNA sequence, with any necessary control elements, 
that codes for a selectable trait in the transformed cell, such as antibiotic 
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resistance. Alternatively; the gene for such selectable trait can be on 
another vector, which is used to co-transform the desired host cell. 

Host cells that have been transformed by the recombinant DNA of the 
invention may then be cultured for a sufficient time and under appropriate 
conditions known to those skilled in the an in view of the teachings 
disclosed herein to permit the expression of the polypeptide, which, if 
desirable, can then be recovered. 

Many expression systems are known, including bacteria (for example 
Escherichia coli and Bacillus subtilis) , yeasts (for example Saccharomyces 
cerevisiae), filamentous fungi (for example Aspergillus), plant cells and 
whole plants, animal cells and insect cells. 

The vectors include a prokaryotic replicon, such as the CoIEl ori, for 
propagation in a prokaryote, even if the vector is to be used for expression 
in other, non-prokaryotic, cell types. The vectors can also include an 
appropriate promoter such as a prokaryotic promoter capable of directing 
the expression (transcription and translation) of the genes in a bacterial 
host cell, such as E. coli, transformed therewith. 

Several promoters are available to direct transcription of bacterial and 
other heterologous genes in plants. In particular, these include the 35S 
promoter of cauliflower mosaic virus (CaMV 35S), the ribulose 
bisphosphate carboxylase small subunit promoter and the Agrobacterium 
T-DNA octopine synthase and manopine synthase promoters. These 
promoters have been widely used, for example, in conjunction with 
bacterial genes conferring herbicide resistance (see D.M. Stalker, ibid.. 
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pp 82-104). These promoters do not confer any specificity of gene 
expression at the organ, tissue or organellar levels, or responsiveness of 
gene expression to environmental influences such as light. 

5 A promoter is an expression control element formed by a DNA sequence 
that permits binding of RNA polymerase and transcription to occur. 
Promoter sequences compatible with exemplary bacterial hosts are 
typically provided in plasmid vectors containing convenient restriction sites 
for insertion of a DNA segment of the present invention. 

10 

Typical prokaryotic vector plasmids are pUC18, pUC19, pBR322 and 
pBR329 available from Biorad Laboratories, (Richmond, CA, USA) and 
p7rc99A and pKK223-3 available from Pharmacia, Piscataway, NJ, USA. 

15 A typical mammalian cell vector plasmid is pSVL available from 
Pharmacia, Piscataway, NJ, USA. This vector uses the SV40 late 
promoter to drive expression of cloned genes, the highest level of 
expression being found in T antigen-producing cells, such as COS-1 cells. 

20 An example of an inducible mammalian expression vector is pMSG, also 
available from Pharmacia. This vector uses the glucocorticoid-inducible 
promoter of the mouse mammary tumour virus long tenninal repeat to 
drive expression of the cloned gene. 

25 Useful yeast plasmid vectors are pRS403-406 and pRS413-416 and are 
generally available from Stratagene Cloning Systems, La Jolla, CA 92037, 
USA. Plasmids pRS403, pRS404, pRS405 and pRS406 are Yeast 
Integrating plasmids (Yips) and incorporate the yeast selectable markers 
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HIS3, TRP1, LEU2 and URA3. Plasmids pRS413-416 are Yeast 
Centromere plasmids (YCps) 

A variety of methods have been developed to operably link DNA to 
vectors via complementary cohesive termini. For instance, 
complementary homopolymer tracts can be added to the DNA segment to 
be inserted to the vector DNA. The vector and DNA segment are then 
joined by hydrogen bonding between the complementary homopolymeric 
tails to form recombinant DNA molecules. 

Synthetic linkers containing one or more restriction sites provide an 
alternative method of joining the DNA segment to vectors. The DNA 
segment, generated by endonuclease restriction digestion as described 
earlier, is treated with bacteriophage T4 DNA polymerase or E. coli DNA 
polymerase I, enzymes that remove protruding, 3 '-single-stranded termini 
with their 3'-5'-exonucleolytic activities, and fill in recessed 3'-ends with 
their polymerizing activities. 

The combination of these activities therefore generates blunt-ended DNA 
segments. The blunt-ended segments are then incubated with a large 
molar excess of linker molecules in the presence of an enzyme that is able 
to catalyze the ligation of blunt-ended DNA molecules, such as 
bacteriophage T4 DNA ligase. Thus, the products of the reaction are 
DNA segments carrying polymeric linker sequences at their ends. These 
DNA segments are then cleaved with the appropriate restriction enzyme 
and ligated to an expression vector that has been cleaved with an enzyme 
that produces termini compatible with those of the DNA segment. 
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Synthetic linkers containing a variety of restriction endonuclease sites are 
commercially available from a number of sources including International 
Biotechnologies Inc, New Haven, CN, USA. 

5 A desirable way to modify the DNA encoding the polypeptide of the 
invention is to use the polymerase chain reaction as disclosed by Saiki ei 
al (1988) Science 239, 487-491. 



In this method the DNA to be enzymatically amplified is flanked by two 
10 specific oligonucleotide primers which themselves become incorporated 
into the amplified DNA. The said specific primers may contain restriction 
endonuclease recognition sites which can be used for cloning into 
expression vectors using methods known in the art. 

15 The present invention also relates to a host cell transformed with a 
polynucleotide vector construct of the present invention. The host cell can 
be either prokaryotic or eukaryotic and it may be comprised in a 
multicellular organism such as a plant. Bacterial cells are preferred 
prokaryotic host cells and typically are -a strain of E. coli such as, for 

20 example, the E. coli strains DH5 available from Bethesda Research 
Laboratories Inc., Bethesda, MD, USA, and RR1 available from the 
American Type Culture Collection (ATCC) of Rockville, MD, USA (No 
ATCC 31343). Preferred eukaryotic host cells include yeast and plant 
cells. Yeast host cells include YPH499, YPH500 and YPH501 which are 

25 generally available from Stratagene Cloning Systems, La Jolla, CA 92037, 
USA. Preferred plant host cells and plants include those from Nicotiana 
spp., Solatium tuberosum (potato), Brassica spp. (eg oil seed rape), Beta 
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SPP- (eg sugar beet, leaf "beet and beetroot). Capsicum spp. and Vanilla 
spp. 



Transformation of appropriate cell hosts with a DNA construct of the 
i present invention is accomplished by well known methods that typically 
depend on the type of vector used. With regard to transformation of 
prokaryotic host cells, see, for example, Cohen et al (1972) Proc. Natl 
Acad. Sci. USA 69, 2110 and Sambrook etal (1989) Molecular Cloning, 
A laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY. Transformation of yeast cells is described in Sherman et al 
(1986) Methods In Yeast Genetics, A Laboratory Manual, Cold Spring 
Harbor, NY. The method of Beggs (1978) Nature 275, 104-109 is also 
useful. With regard to plant cells and whole plants three plant 
transformation approaches are typically used (J. Draper and R. Scott in D 
Grierson (ed.), "Plant Genetic Engineering", Blackie, Glasgow and 
London, 1991, vol. 1, pp 38-81): 

0 Agrobacterium-medmed transformation, using the Ti plasmid of A 
tumefaciens and-the Ri plasmid of A. rhizogenes (P. Armitage, R. Walden 
and J. Draper in J. Draper, R. Scott, P. Armitage and R. Walden (eds.), 
-Plant Genetic Transformation and Expression - A Laboratory Manual", 
Blackwell Scientific Publications, Oxford, 1988, pp 1-67; R.J. Draper, 
R.Scott and J. Hamill ibid., pp 69-160); 

•i) DNA-mediated gene transfer, by polyethylene glycol-stimulated 
DNA uptake into protoplasts, by electroporation, or by microinjection of 
protoplasts or plant cells (J. Draper. R. Scott, A. Kumar and G. Dury, 
ibid., pp 161-198); 
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iii) transformation using particle bombardment (D. McCabe and P. 
Christou, Plant Cell Tiss. Org. Cult., 3, 227-236 (1993); P. Christou, 
Plant J., 3, 275-281 (1992)). 



5 Agwbacterium-vasAvsXeA transformation is generally ineffective for 
monocotyledonous plants (eg Vanilla), for which approaches ii) and iii) are 
therefore preferred. In all approaches a suitable selection marker, such 
as kanamycin- or herbicide-resistance, is preferred or alternatively a 
screenable marker ("reporter") gene, such as /3-gIucuronidase or luciferase 
io (see J. Draper and R. Scott in D. Grierson (ed.), "Plant Genetic 
Engineering", Blackie, Glasgow and London, 1991, vol. 1 pp 38-81). 

Electroporation is also useful for transforming cells and is well known in 
the art for transforming yeast cell, bacterial cells and plant cells. 

15 

For example, many bacterial species may be transformed by the methods 
described in Luchansky et al (1988) Mol. Microbiol. 2, 637-646 
incorporated herein by reference. The greatest number of transformants 
is consistently recovered following electroporation of the DNA-cell 
20 mixture suspended in 2.5X PEB using 6250V per cm at 25/tFD. 

Methods for transformation of yeast by electroporation are disclosed in 
Becker & Guarente (1990) Methods Enzymol. 194, 182. 

25 Successfully transformed cells, ie cells that contain a DNA construct of 
the present invention, can be identified by well known techniques. For 
example, cells resulting from the introduction of an expression construct 
of the present invention can be grown to produce the polypeptide of the 
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invention. Cells can be" harvested and I yse d and their DNA content 
examined for the presence of the DNA using a method such as that 
described by Southern (1975) J. MoL Biol. 98, 503 or Berent et al (1985) 
Biotech. 3, 208. Alternatively, the presence of the protein in the 
i supernatant can be detected using antibodies as described below. 

In addition to direcUy assaying for the presence of recombinant DNA, 
successful transformation can be confirmed by well known immunological 
methods when the recombinant DNA is capable of directing the expression 
of the protein. For example, cells successfully transformed with an 
expression vector produce proteins displaying appropriate antigenicity. 
Samples of cells suspected of being transformed are harvested and assayed 
for the protein using suitable antibodies. 

Thus, in addition to the transformed host cells themselves, the present 
invention also contemplates a culture of those ceils, preferably a 
monoclonal (clonally homogeneous) culture, or a culture derived from a 
monoclonal culture, in a nutrient medium; and also, in the case of plant 
cells, a plant derived from, and containing, such cells. 

It is particularly preferred if the host cell comprises a nucleic acid which 
encodes any one of, or combination of, a polypeptide which, in the 
presence of appropriate cofactors if any is capable of catalysing the 
interconversion of rm^-feruloyl SCoA and 4-hydroxy-3- 
methoxyphenyl-^-hydroxypropionyl SCoA (HMPHP SCoA) or a 
polypeptide which, in the presence of appropriate cofactors if any, is 
capable of catalysing the interconversion of 
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4-hydroxy-3-memoxyphenyl-0-hydroxypropionyl SCoA (HMPHP SCoA) 
and vanillin. 

It will be appreciated that the host cells of the invention or an extract 
5 thereof are particularly suited for use in the methods of the invention. It 
is particularly preferred if the host cell does not contain means for 
converting vanillin to a non-vanillin product. 

It is most preferred if the host cell is a plant cell or is comprised in a 
10 whole plant or a bacterial cell or a yeast cell. Preferred bacterial hosts 
include lactic acid bacteria such as Laaococcus spp. and Lactobacillus 
spp. Preferred yeast hosts include Saccharomyces cerevisiae and its 
biovars. It is particularly preferred if the host cell is a food-grade host 
cell (for example a microorganism which is used or can be used in the 
15 food or beverage industry). It is also preferred if the plant is an edible 
plant. 

It will be appreciated that some host cells or host organisms may already 
contain enzyme activities I, II, III or IV and, in that case, it may be 
20 sufficient, in order to use the host cells in the methods of the invention, 
to introduce into said host cell or host organism one or more 
polynucleotides which encode enzyme activities II, III or IV which encode 
those enzyme activities which are deficient in the host cell or host 
organisms. 

25 

In the case of plants for use in the methods of the invention, it is preferred 
that the relevant gene expression is directed to target organs, tissues and 
subcellular organelles where rra/w-feruloyl SCoA, or other appropriate 
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substrates (eg 4-^-coumaroyl SCoA or rro/w-caffeoyl SCoA) for the 
enzymes encoded by the transferred genes, are most readily available. 
The stage at which thioesterification with CoASH occurs in plants, in 
relation to the progressive substitution of the phenyl ring which takes place 
during the conversion from rra^innamate to rro^-ferulate, is unclear 
and may be variable (see R. Whetten and R. Sederoff, The Plant Cell 7 
1001-1013 (1995)). The subcellular localisation or distribution of these 
intermediates during plant phenylpropanoid metabolism also remains 
uncertain; it is likely that they are cytosolic, or that some functional 
organisation of the enzymes which metabolise them occurs. The concept 
of the metabolic cluster or "metabolon", in which there is a degree of 
metabolic channelling and free diffusion is restricted, has been proposed 
and discussed (see R.A. Dixon and N.L. Paiva, The Plant Cell, 7, 1085- 
1097 (1995) and loc cit.). 

Trans-ferula 4-^-c 0 umarate and rr^-caffeate are normal metabolic 
intermediates. Thus, there may be no requirement to manipulate host 
plants in order to provide rra^-feruloyl SCoA. Their concentrations are 
expected to be influenced in varying degrees by physiological requirements 
for a wide range of end-products of the phenylpropanoid pathway, 
including for example lignin, coumarins and flavonoids. There is some 
evidence that the activity of the first enzyme of the phenylpropanoid 
pathway, phenylalanine ammonia lyase (PAL: EC 4.3.1 .5), can influence 
the accumulation of end-products of the pathway (N. Bate, J. Orr, W. Ni 
A. Meromi, T. Nadler-Hassar, P.W. Doemer, R.A. Dixon, C.l. Lamb 
and Y. Elkind, Proc. Natl. Acad. Sci. USA, 91, 7608-7612 (1994)) so 
under certain circumstances it is possible to enhance the metabolic effects 
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of the expression of the genes for enzyme activities II and III by 
increasing the expression of PAL. 



It is well known that gene expression in the phenylpropanoid pathway is 
5 responsive to a range of environmental and stress factors, including 
wounding, chemical elicitors of pathogenic origin, and u/v light. The 
mechanisms regulating these responses are not very well understood, 
though several transcription factors have been identified (see R.A. Dixon 
and N.L. Paiva, The Plant Cell, 7, 1085-1097 (1995)). However, 
10 particularly when these are more fully characterised, they do or will offer 
opportunities for predictable and inducible control of gene expression, 
particular to enhance the provision of substrate. 



Thus, a further aspect of the invention provides a transgenic plant 
15 comprising a polynucleotide according to any of the fourth or fifth aspects 
of the invention. In other words, the transgenic plant is genetically 
engineered to encode and, preferably, express any one or more of enzyme 
activities II or III. It is particularly preferred that, following said genetic 
engineering the plant is able to produce vanillin from tawu-feruloyl SCoA. 
20 It will be appreciated that, depending on the enzymes present in the host 
plant, it may be necessary only to provide a gene encoding only a single 
of said enzyme activities or it may be necessary to provide a gene or 
genes of any two of said enzyme activities. 



25 Conveniently, the transgenic plant is genetically engineered to encode and, 
preferably, express enzyme activities II and III. It can be readily seen that 
the transgenic plants of this aspect of the invention may be used in the 
methods of producing vanillin of the invention, especially when the plant 
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provides the enzyme activity that interconverts /raw-fenilic acid or a salt 
thereof and /ra/w-feruloyl SCoA. 

Preferably the plant is a plant which is readily transformed. Preferably 
the plant is a plant which is commonly used in agriculture or horticulture 
and more preferably the plant is an edible plant. Advantageously the plant 
is a plant in which it is desirable to introduce a vanilla flavour or aroma. 

Preferred plants include those selected from Nicotiana spp., Solarium 
tuberosum, Brassica spp., Capsicum spp., Beta spp. and Vanilla spp. 

As is described in more detail below the plant may be eaten or may be 
processed into a foodstuff or beverage. Thus, conveniently the transgenic 
plant is processed or prepared so that it is not capable of reproduction or 
cultivation, for example the transgenic plant is harvested from the 
environment in which it was grown. 

When vanillin (or the desirable products such asp-hydroxybenzaldehyde) 
is produced in a host cell or organism of the invention, especially if it is 
produced in a transgenic plant of the invention, the vanillin or desirable 
product may initially be present in the form of a glycoside, more 
particularly, a 0-D-glycoside, or, in the case of a carboxylic acid, as 
esters of 0-D-glucose (as occurs in Vanilla pod). In this case, it is 
desirable to release the vanillin (and desirable product) into its uncombmed 
form, for example by acid- or base-catalysed hydrolysis or by the use of 
glycosides such as the 0-D-glucosidase (emulsin; S. Hestrin, D.S. 
Feingold and M. Schramm, Meth. Enzymol. /, 231-257 (1955); see also 
D. Chassagne, C. Bayonore, J. Crouzet and K. Baumes in "Bioflavour 
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95", eds. P. Etievant and P. Schreier, INRA, Paris, pages 217-222 
(1995).). 

In relation to the use of a microorganism such as Ps. fluoresces biovar 
5 V, strain AN103 or of a microorganism which has been genetically 
modified to contain enzyme activities II and in (or at least those of these 
activities that it does not normally have), it is preferred that said 
microorganism is provided with rrow-feruloyl SCoA or a means to 
provide said CoA thioester from /ro/w-ferulic acid or a salt thereof at least 
10 in its culture medium. 

Thus, it can be seen from all of the foregoing description that the 
invention includes biochemical and fermentative processes for producing 
vanillin and vanillic acid, recombinant or transgenic plants and the use of 
15 the said plants in a method of making vanillin or vanillic acid. 

Typically, in a biochemical process the strain of Pseudomonas (eg Ps. 
fluorescent biovar. V, strain AN103) provides an enzyme system for the 
biotransformation of plant derived /rons-ferulic acid to vanillin and/or 

20 related compounds. Enzyme preparations, whole cells of Pseudomonas or 
a heterologous host organism expressing appropriate Pseudomonas genes 
may be used for this. A variety of mutants of Pseudomonas and various 
additional enzyme preparations, co-factors or co-factor regenerating 
systems may be used. The Pseudomonas enzymes might be overexpressed 

25 in a heterologous host before being extracted and used in a 
biotransformation. 
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Alternatively, but suitably, some form of fermentation process may be 
used which involves the Pseudomonas strain or an appropriate derived 
mutant or a heterologous host organism in which the genes for 
biotransformation are expressed. The chosen microorganism is typically 
grown on a ferulate-rich substrate or a substrate comprising rra^-feruloyl 
SCoA. This could generate a vanillin production process. 

In addition the invention includes recombinant microorganisms (eg lactic 
acid bacteria) which are modified to contain genes encoding enzyme 
activities II and 01. For example, lactic acid bacteria modified according 
to the invention may be used to produce vanffla-flavoured yoghurt 
provided that they are supplied with mww-feruloyl SCoA. 

Advantageously, genes for vanillin production (such as those encoding 
enzyme activities II and III) are expressed in a variety of plant species 
such that vanillin accumulates in an appropriate tissue. In this case a new 
crop plant may be cultivated and vanillin would be extracted. Thus, a 
sugar beet plant may be made according to the invention in which the beet 
was rich in vaniHin. In addition the development of a novel plant cultivars 
for direct consumption (eg vanilla-flavoured capsicum), or even for their 
desirable aroma properties, included in the invention. 

The polypeptides of the invention, or the genes which encode them, may 
be used either individually or in combination (whether as substantially 
isolated polypeptides, or in cell-free extracts or as host cells or organisms 
which encode and, preferably, express said polypeptides) to convert a 
compound into a desirable product. Certain compounds and desirable 
products have been described above. However, the invention also 
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includes the production of any other desirable product, such as a flavour 
or aroma, from substrates related to rra/w-ferulate and other known 
substrates of the polypeptides (enzymes) of the invention. Thus, the 
polypeptides or genes of the invention, either individually or in 
5 combination, may be used in processes for converting, for example, 
synthetic substrates of the said polypeptides (enzymes) into novel flavours 
and aromas or they may be used to modify the chemical profile of known 
flavours or aromas. 



10 Similarly, it will be appreciated that certain desirable products can be 
made by the further action of enzyme activity IV upon certain compounds, 
particularly those produced by enzyme activities I, II and m. For 
example, enzymes activities I, II and III may be used to convert trans-4- 
coumaric acid or fra/tf-4-coumaroyl SCoA to p-hydroxybenzaldehyde and 

15 enzyme activity IV may then be used to convert p-hydroxybenzaldehyde 
to p-hydroxy-benzoic acid. Thus, the invention includes a method of 
producing p-hydroxy-benzoic acid using at least one of enzyme activities 
I, II, III and IV and advantageously using all of them. 



20 A further aspect of the invention provides a food or beverage comprising 
a host cell comprising a polynucleotide of the invention, or an extract of 
said host cell. The host cells comprising one or more polynucleotides of 
the invention, especially those such host cells which produce vanillin by 
virtue of the presence of said polynucleotide or polynucleotides, may be 

25 used in the production of food or beverages. In particular, as discussed 
above, lactic acid bacteria which produce vanillin by the methods of the 
invention may be used in the production of cheese, yogurt and related 
products including milk drinks. Similarly, yeasts which produce vanillin 
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by the methods of the invention may be used in the production of food and 
beverages such as bread and beer. Extracts of said host cells may also be 
used in the food or beverage industry. 

A still further aspect of the invention provides a food or beverage 
comprising a transgenic plant comprising a polynucleotide of the 
invention, or a part or extract or said transgenic plant. The transgenic 
plant comprising one or more polynucleotides of the invention, especially 
those such transgenic plants which produce vanillin by virtue of the 
presenceofsaidpolynucleotide or polynucleotides, may constitute the food 
itself or they may be processed to form the food or beverage. For 
example, a tuber of a transgenic potato of the invention consututes a food 
ofthisaspectofthe invention. Alternatively, said transgenic potato may 
be processed into another foodstuff which is, nevertheless, a food of this 
aspect of the invention. 

Still fimher aspects of the invention provide use of Pseutomonas 
fluorescens biovar. V, strain AN103 or a mutam or derivative Hereof in 
a method for producing vanillin, or vanillic acid or sal, .hereof; use of a 
polypeptide of the invention in a method for producmg vanillin, or vanillic 
acid or sal, .hereof; use of a polynucleotide of me invention in a method 
for producing vanillin, or vanillic acid or sal. thereof; and use of a hos, 
cell of the invention in a method for producing vanillin or vanillic acid or 
a salt thereof. 

As is clear from the foregoing the invention also includes a method of 
producing vanillin or vanillic acid, or other related products, the method 
comprising providing /ra^-feruloyl SCoA (or any other suitable CoASH 
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thioester which can be acted upon by enzyme activity II) and providing 
enzyme activity II, enzyme activity III and, in the case of vanillic acid 
production or another related product, enzyme activity IV. 7ra/u-feruloyl 
SCoA is obtainable by the method described in Example 2 or it may be 
5 obtained using the methods of Zenk etal (1980) Anal. Biochem. 101, 182- 
187, incorporated herein by reference. Other CoASH thioesters which 
may be substrates for enzyme II are also described in Zenk et al. 

The preferred method steps and organisms for use in the methods, and the 
foods and beverages of the earlier aspects of the invention are also 
preferred in this method of the invention to the extent that they are 
compatible with this method, and the organisms used in this method. 

As is discussed above, mwiy-feruloyl SCoA (and related CoASH thioesters 
such as 4-tawts-coumaroyl SCoA and /ra^-caffeoyl SCoA) are normal 
metabolic intermediates in plants. Thus, a transgenic plant which 
comprises a polynucleotide or polynucleotides which encode, and 
preferably express, enzyme activities II and III in a location in the plant 
which contains rro/tr-feruloyl SCoA or other suitable CoASH thioester is 
particularly suited for the purposes of this aspect of the invention, as is 
described above, such transgenic plants and products derived therefrom 
form part of the invention. 

The invention will now be described in more detail with reference to the 
following Examples and Figures wherein: 

Figure 1 describes the vanillin pathway in Pseudomonas fluorescens 
biovar. V, strain AN103. HMPHP SCoA is 4-hydroxy-3-methoxy- 
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phenyl-^hydroxypropionyl SCoA. I is an enzyme that catalyses the 
interconversion of /row-ferulic acid and /ro^-feruloyl SCoA; II is an 
enzyme that catalyses the interconversion of rra^-feruloyl SCoA and 
HMPHP SCoA; HI is an enzyme that catalyses the interconversion of 
HMPHP SCoA and vanillin; and IV is an enzyme that catalyses the 
interconversion of vanillin and vanillic acid. 

Figure 2 illustrates the growth of strain AN103 following transfer to MM 
medium cotitaining 10 mM vanillate (V), 10 mM rrow-ferulate (F) or 10 
mM rra^-ferulate plus 10 mM vanillate (FV). Cultures were previously 
grown in MM medium containing 10 mM vanillate. 

Figure 3 indicates the changes in ^-ferulate and vanillate 
concentrations during growth of strain AN103 on MM medium containing 
10 mM /ra/w-feruiate. 



Figure 4 shows the production of vanillin (van) and vanillate (VA) by an 
extract of cells of strain AN103 (165 Mg protein) incubated with 
^-ferulate, ATP, CoASH and Mg- ions, both in the absence of NAD* 
and in its presence (0.5mM). Cells were grown in the presence of 10 mM 
mvw-ferulate, plus 10 mM vanillate. 

Figure 5 demonstrates the formation of feruloyl SCoA, vanillin and acetyl 
SCoA from /r^-ferulate supplied to a PDlO-treated extract of 
/r^-ferulate-grown cells of strain AN103 ( l m protein) in the presence 
of ATP, CoASH and Mg 2+ ions. 
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Figure 6 demonstrates the production of vanillin, acetyl SCoA and feruloyl 
SCoA from HMPHP SCoA supplied to a PDlO-treated cell-free extract 
Ofig protein) of fra/w-ferulate-grown cells of strain AN103. 



5 Figure 7 shows the induction over time of /ranj-ferulate:CoASH ligase 
activity in response to 10 mM rra/u-ferulate (F), 10 mM vanillate (V) and 
10 mM fra/w-ferulate plus 10 mM vanillate (FV) present in MM medium. 
The inocula were grown in MM medium plus 10 mM vanillate; growth 
conditions, enzyme extraction and assay were as described in Examples 
io 1 and 2. 



Figure 8 shows SDS-PAGE of A), an extract of cells grown in MM 
medium with 10 mM /ra/w-feralate, electrophoresed at successive stages 
of purification of the HMPHP SCoA cleavage enzyme; successive stages 
15 are Crude Extract, Mono Q-purified, Mono-P-purified and Phenyl 
Superose-purified, and B), extracts of cells grown in MM medium with 
either lOmM vanillate or lOmM rro/w-ferulate and electrophoresed 
alongside Mono-P-purified cleavage enzyme; A) silver-stained; B) 
Coomassie-stained . 

20 

Figure 9 shows EcoRI/PstI digests of cosmid clones pFT793, pFI794, 
pFI795 and pFI796 separated on an agarose gel. 



Figure 10 shows the sequence of the redundant primers designed from 20 
25 N-teraiinal amino residues of the 31-kDal protein (SEQ ID Nos. 5 and 6). 



WO 97/35999 

PCT/GB97/00809 

51 

Figure 11 shows a Southern blot of EcoRI/PstI digests of various cosmid 
Cones probed with the PGR product amplified using the N-tenninal 
degenerate oligonucleotide primers as shown in Figure 10. 

5 Figure 12 shows the nucleotide sequence of P FI989 (ie the 4370 bp 
EcoRI/PstI fragment from P FI794), together with the succeeding 882bp 
determined from a further subclone, pFI1056 and from pFI794 itself (SEQ 
ID No 7). The amino acid sequence of the 31 kD protein and that 
corresponding to the succeeding open reading frame encoding 
io vanillin.NAD' oxidoreductase (vanillin dehydrogenase) (SEQ ID Nos. 2 
and 4) are also shown. 

Figure 13 shows the nucleotide sequence of pFI901 (ie the 1.8 kb 
EcoRI/PstI fragment from pFI793) (SEQ ID No 8). 



15 



Figure 14 shows the nucleotide sequence of pFI911 (ie the 850 bp 
EcoRI/PstI fragment from pFI793) (SEQ ID No 9). 

Figure 15 shows the nucleotide sequence of P FI912 (ie the 958 bp 
20 EcoRI/PstI fragment from pFI793) (SEQ ID No 10). 

Figure 16 shows the nucleotide sequence of P FT913 (ie the 959 bp 
EcoRI/PstI fragment from pFI793) (SEQ ID No 11). 

25 Figure 17 is a diagrammatic representation of the outward reading primers 
for P FI901 (P35 and P39), pFI911 (P32 and P36), pFI912 (P33 and P37) 
and pFI913 (P34 and P38). 
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Figure 18 is a diagrammatic representation showing the formation of the 
1.5 kb PCR product, using primers P34 and P39, which spans the region 
in the cosmid between the inserts of pFI913 and pFI901. 

5 Figure 19 shows the nucleotide sequence of the merged contigs 
pFI913/PCR product/pFI901 (4259 bp) (SEQ ID No 12). 

Example 1: Isolation and growth nf P seudomnna* fluoresce hi nvar 
V. strain AN1H3 

10 

Experimental 



Growth media 



15 Organisms were grown on the following media: 

Minimal Medium (MM) contained, per 1: KH 2 P0 4 , 5g; (NH 4 ) 2 S0 4 , lg; 

FeS0 4 , 0.5 mg; CaCl 2 , 0.5 mg; MnCl 2 .5H 2 0, 5 mg; (NH 4 ) 6 Mo,0 7 .4H 2 0, 

1.1 mg; MgS0 4 , 5 mg; EDTA, 50 mg; ZnS0 4 .7H 2 0, 28 mg; 

CuS0 4 .5H 2 0, 1.6 mg; CoCl 2 .6H 2 0, 1.6 mg. The pH was 7.0. Carbon 
20 sources were included as indicated. 



Tryptone- and yeast-based medium (LBMod) contained, per 1 : tryptone 
(Bacto; Difco, Detroit, USA), 10 g; yeast extract (Bacto), 5 g; NaCl, 10 
g. The pH was adjusted to 7.5. LB Medium was identical with LB Mod, 
25 with the addition of glucose (lg/1). 

Solid media were prepared with the addition of agar ( Difco), 15 g/l. 
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Isolation of Pseudomonas fluoresces biovar. V, strain AN103 

The organism was isolated from surface soil, on the basis of ability to 
grow on rra^-ferulate as sole carbon source. Initially, a Ig soil sample 
5 was added to 100ml sterile Minimal Medium (MM), containing trans- 
ferulic acid (10 mM). After 2 weeks at 25°C, with shaking at 200 rpm, 
a sample (100 M 1) was removed and added to 200ml of fresh medium 
containing lOmM /ro^-ferulic acid; this was repeated twice. Serial 
dilution onto solid medium (MM) containing lOmM trans-femlzte as sole 
) carbon source enabled isolated colonies to be obtained which were replica- 
plated onto MM plates containing individual substrates as carbon sources. 
Several strains able to use mzw-ferulate as sole carbon source were 
isolated - one (AN103), which was capable of growing also on vanillin, 
was selected for further work. 



Growth of strain AN103 



The organism was grown routinely at 25°C on MM, with shaking, using 
vanillic acid (lOmM) or fra«s-ferulic acid (lOmM) as sole carbon source; 
50ml of medium was used in a 250ml Erlenmeyer flask. Growth was 
monitored by measuring absorbance at 550 or 600nm. 

For long-term storage, bacteria from logarithmic-phase cultures were 
centrifuged and then resuspended in Minimal Medium (MM) containing 
50% glycerol. They were then stored at -70°C. Cultures from these 
frozen stocks were reinitiated by transfer onto LB or LB-MOD solid 
medium, followed by inoculation into liquid medium containing lOmM 
/ra/zs-ferulic acid. 
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Results 

The organism was isolated from soil samples rich in decayed vegetation 
and was shown to be a strain of Pseudomonas fluoresces using standard 
5 identification techniques. As shown in Table I, the bacterium would grow 
not only on /ro/ts-ferulate as sole carbon source, but also on several 
closely-related compounds, including vanillate, protocatechuate and 
caffeate. Growth on vanillin was observed at low concentrations ( < lmM) 
but was variable; higher concentrations were growth-inhibitory. If the 

10 organism was grown on vanillate, transfer to medium containing trans- 
ferulate as sole carbon source was followed by a lag in the growth curve; 
this was not observed if the transfer was to medium containing both 
vanillate and rrow-ferulate (Fig 2). During a growth cycle on 
rroHs-ferulate, a transient increase in vanillate was observed at around the 

15 time when /ra/w-ferulate disappearance was maximal (Fig 3), suggesting 
that vanillate was a catabolite of /ra/w-ferulate. A small amount of 
protocatechuate was also observed when the culture medium was examined 
by TLC (not shown). 
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Table Is Relative growth of ft. fluorescent biovar. V, strain AN103 
on a range of carbon substrates 

Substrates were provided j, MM Median. a t 10 mM concentration 
5 (vanillin, ImM) and relative grown, after 48h a, 25'C was moniwred by 
measuring absorbance at 600 mn. 



10 



15 



20 



25 



Substrates 



Relative Growth (%) 



Ferulic acid 
Caffeic acid 
Sinapic acid 
Cinnamic acid 
Vanillin 
Vanillic acid 
Protocatechuic acid 
Protocatechuic aldehyde 
Glucose 
Acetate 
Methanol 



100 
79 
0 
0 

<100 
140 
77 
0 
221 
47 
0 



Kmn^ l: IWfen,.a . e — ^ - ^ frrr 
mechanism of cleavag e 

Experimental 

Chemicals 



Chemicals and biochemicals were routinely obtained from Sigma Chemical 
Co. Ltd, Poole, Dorset, UK, Aldrich Chemical Co. GiUingham, Dorset, 
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UK or BDH-Merck, Poole, Dorset, UK. The synthesis of CoASH 
thioesters is described below. 

Preparation of 4tydroxy-3-rnethoxyphenyl-&-hydroxypropionyl SCoA 
i (HMPHP SCoA) 

This compound was prepared starting from a Refonnatsky condensation 
of vanillin with ethyl bromoacetate (see R.L. Shriner, The Reformats/? 
Reaction in "Organic Reactions", R. Adams, W.E. Bachmann, L.F. 
Fieser, J.R. Johnson and H.R. Snyder, eds., vol. 1, pp 1-37, John Wiley, 
New York [1942]), followed by purification of the resulting ethyl 4- 
hydroxy-3-methoxyphenyl-j3-hydroxypropionate (ethyl HMPHP) by 
HPLC, hydrolysis to the free acid, A^-succinimidylation and, finally, 
exchange of the JV-succiiiimidyl group with CoASH and isolation of the 
CoASH thioester by preparative TLC (see V. Semler, G. Schmidtberg and 
G.G. Gross, Z. Naturforsch. 42c, 1070-1074 [1987]). 

Vanillin (3g) was mixed with 1.9 ml of ethylbromoacetate and 2g of dry 
Zn dust in 60 ml of dry 1 ,4-dioxane in a round-bottomed flask fitted with 
drying tubes and a reflux condenser. The reaction mixture was heated 
gently to boiling using a heating mantle and refluxed gently for ca. Ih. 
After being allowed to cool, the mixture was acidified with 60 ml of 10% 
H 2 S0 4 and extracted with 4 x 120 ml of diethyl ether. The combined 
ether phases were dried with anhydrous Na 2 S0 4 and unreacted vanillin was 
removed by washing with 3 x 100 ml of sat. K 2 S 2 0 5 . The ether phase 
was then rotary evaporated under vacuum at ca. 30°C to remove the 
ether, leaving a liquid residue (ca.lOml). This was then applied to a 
preparative C-18 reverse-phase HPLC column (Dynamax 60A, 8^m, 



) 
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250mm x 41mm; Rainin, Woburn, MA, USA) and eluted at 12 ml min 1 
with a gradient of MeOH/H 2 0, containing ImM trifluoroacetic acid. 
[Solvent A comprised 40% MeOH/lmM trifluoroacetic acid; solvent B 
comprised 100% MeOH/lmM trifluoroacetic acid; at time = 0 min, 
> solvent was 20% B, rising linearly to 40% B at 28 min and 100% B at 35 
min]. Fractions were monitored by absorbance at 280 nm and material 
eluting between 37 and 45 min was collected. The solvent was removed 
under vacuum at ca. 35°C and the remaining material brought to -20°C 
overnight. The precipitate which formed was then filtered off rapidly and 
freeze-dried to give 300mg of white substance. This was identified as 4- 
hydroxy-3-methoxyphenyl-^hydroxypropiomc acid ethyl ester (ethyl 
HMPHP) by MS [M ]=240 and, on alkaline hydrolysis (1M KOH; 30 
min), gave rise to the free acid. 



To generate the ^succinimidyl ester, 30 mg of ethyl HMPHP was 
hydrolysed for 40 min at room temperature in 0.5 ml of 1M KOH. 
Oxalic acid (0.6 ml of 0.5 M) was then added to bring the pH to ca. 3-4. 
The solution was extracted successively with E^O (5 x ca. 10ml); the 
organic phases were then pooled and evaporated to dryess. N - 
Hydroxysuccinimide (0.1 mmol; 11.5 mg) was then added in 1.2 ml of 
dry 1,4-dioxane. This was then followed, gradually, by 0.1 mmol (20.7 
mg) of dicyclohexylcarbodiimide in 0.6 ml of dry 1,4-dioxane. The 
reaction mixture was allowed to stand at room temperature for ca. 4h and 
then filtered to remove precipitated DCU, a further 1.8 ml of dry dioxane 
being added to wash the filter. 

The tf-succinimidyl ester was not isolated from the reaction mixture but 
was converted in situ into the CoASH thioester. Lithium CoASH (40 mg; 
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ca. 0.05 mmol) was dissolved in 2.4 ml of 0.1M NaHC0 3 and the 
reaction mixture was added; the exchange reaction was performed under 
N 2 , with stirring, for ca. 2h at room temperature. The pH of the mixture 
was then adjusted to ca.3-4 by the addition of 70 M I of 2.8 M HC1 and the 
5 mixture was stored at -70X. The CoASH thioester of HMPHP was 
finally isolated by preparative TLC. Cellulose TLC plates (Avicel; 
1000,1m; Analtech, Newark, DE, USA), to each of which was applied 200 
fil of reaction mixture, were developed in /iBuOH/ HOAc/ H z O (5/2/3, 
v/v/v). The CoASH thioester was localised at R F 0.4 - 0.5 using a short- 
io wave u/v lamp and recovered from the plate by scraping and elution with 
50% MeOH. Identification was confirmed by MS [M =960] and by 
hydrolysis to the free acid (cf. ethyl HMPHP) which was measured by 
HPLC and used routinely as the basis for assay. The CoASH thioester 
showed an absorption maximum at 258 nm, and lacked the absorption 
15 maximum at 345nm characteristic of rra/w-feruloyl SCoA. This molecule 
- and the corresponding ethyl ester and free acid - carry an asymmetric 
centre at the 0-carbon; however, no attempt was made here to resolve the 
optical isomers during or after synthesis. 



20 Preparation of vanilloyl SCoA 



Vanilloyl SCoA was produced from vanillic acid via the A^succinimidyl 
ester, essentially according to the method described by V. Semler, G. 
Schmidtberg and G.G. Gross (Z. Natursforsch. 42c, 1070-1074 [1987]) for 
25 the synthesis of piperoyl SCoA. 

To a stirred solution of vanillic acid (5 mmol) and N - 
hydroxysuccinimide (5 mmol) in 30 ml of dry 1,4-dioxane was added, in 
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small portions, 7.5 mmol of solid dicyclohexylcarbodiimide. The solution 
was stirred overnight at room temperature, and precipitated DCU was 
removed by filtration. The filtrate was evaporated under reduced pressure 
at 40°C and the oily residue dissolved in boiling CHCI 3 . The N- 
5 succinimidyl vanillate was crystallised from solution by the dropwise 
addition of petroleum ether (b.p.: 30°C - 40° C). Approx 750 mg was 
recovered. 



10 



15 



20 



To generate vanilloyl SCoA from N - succimmidyl vanillate, CoASH 
(sodium salt; 200 mg) was dissolved in 4 ml of 0.1 M NaHC0 3 . N - 
Succinimidyl vanillate (120mg in 4 ml of dioxane) was then added 
gradually over a ca.40 min period at room temperature, sparging with N,. 
A further 4 ml of 0. 1M NaHC0 3 was then added, together with a further 
8 ml of dioxane. Incubation under N 2 at room temperature, with stirring, 
was continued for a further lh. The pH was then adjusted to ca. 2.8 with 
1M HC1 and the solution was frozen and stored at -70°C. Isolation of 
vanilloyl SCoA was by preparative TLC, as described above with n 
BuOH/HOAc/H 2 0(5/2/3, v/v/v) as solvent. Vanilloyl SCoA (R F . 0.5-0.6) 
was identified using a short-wave u/v lamp and recovered by scraping, 
elution with 40% MeOH and freeze-drying. Identifkauon was confirmed 
by MS ([M ]=916) and the thioester liberated vanillic acid on alkaline 
hydrolysis. 



Preparation of trans-feruloyl SCoA and other cinnamoyl SCoA thioesters 

7raw-feruloyl SCoA was prepared from rra/w-ferulic acid via the N - 
succinimidyl ester, as described above for vanilloyl SCoA. Final isolation 
was achieved similarly by preparative TLC and elution, identification 
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being confirmed by MS and by alkaline hydrolysis to free trans-kmlic 
acid. Caffeoyl andp-coumaroyl-SCoA thioesters were prepared similarly. 

Preparation of cell-free extracts 



Cell-free extracts of logarithmic-phase cultures (6-10h after inoculation) 
were prepared by sonication. Cells from ca. 200ml of medium were 
pelleted by centriragation, and resuspended in 5-10ml of Extraction Buffer 
(routinely 40mM KPi; pH 7.2, containing lOmM dithiothreitol). They 
were then sonicated (MSE Soniprep 150; Fisons Instruments, Crawley, 
Sussex, UK) at 4°C (5 x 20s; 22 Amplitude microns on full power), and 
centrifuged (20 000 x g; 20 min; 4°C). Extracts were routinely stored 
frozen at -70°C and in some instances buffer-changed using a PD10 
column (Pharmacia) before use. The protein contents of extracts were 
variable - between 0.25 and 1.8 mg/ml. 



Incubation of cell-free extracts 



Cell-free extracts were routinely incubated at 30°C and pH 7.5 in a 
reaction mixture (1ml) containing 90mM Tris HCI buffer and 2.5mM 
MgCl 2 , together with (as appropriate) 0.5mM fra/w-ferulic acid, 0.2mM 
CoASH (Li salt) and 2.5 mM ATP. This complete reaction mixture 
constituted an assay for rrans-ferulate: CoASH ligase, where the initial 
increase in absorbance at 345 nm was monitored against a blank reaction 
mixture from which CoASH was omitted. Incubations with HMPHP 
SCoA (generally 0.4mM) were performed similarly, but with the omission 
of rra/w-ferulic acid, CoASH and ATP. 
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Vanillin: NAD* oxidoreductase was assayed at 30°C and pH 7.0 by 
monitoring the initial decrease in absorbance at 340 nm against a blank 
cuvette from which NAD + was omitted. Because of the similarity in 
extinction coefficient at 340nm for vanillin and for NADH, the sensitivity 
of the assay was increased by catalysing the regeneration of NADH to 
NAD + by providing lactate dehydrogenase and pyruvate. Reaction 
mixtures contained, in 1ml volume, 75 mM KPi buffer, pH 7.0, 0. 125mM 
vanillin, 1 .2 mM Na pyruvate .lactate dehydrogenase (rabbit muscle), 1.1 
U and NAD + , 0.5 mM. 



HPLC analysis 



Metabolites of trans-fenlic acid, including the CoASH thioesters, were 
analysed and quantitated by HPLC using a Lichrosorb RP-18 column (20 
cm x 4.6 mm; Capital HPLC, Broxburn, West Lothian, UK) with a 
multiphasic gradient; solvent A" was 20mM NaOAc, adjusted to pH 6.0 
and solvent B" was MeOH; the flow rate was 1 .2 ml/min; the proportion 
of solvent B" rose linearly from 0% at 0 min to 10% at 15 min and 
thence to 50% at 40 min and 70% at 45 min, finally decreasing to 0% at 
50 min. Detection was with a Spectra Focus detector (Thermo Separation 
Products, Stone, Staffs. UK), which permitted u/v spectral analysis of 
each eluting component. 

Typical approximate retention times were: CoASH, 3 min; vanillic acid, 
7 min; rra/w-feruiic acid, 19 min; acetyl SCoA, 22 min; HMPHP SCoA, 
29 min; vaniiloyl SCoA, 31 min; vanillin, 31.5 min; trans-kruloyl SCoA, 
34 min. 
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Mass spectrometry 



Mass spectra (+-ve and -ve ion) were recorded on a MS 9/50 mass 
spectrometer (Kratos Instruments, Manchester UK), using xenon fast atom 
5 bombardment (FAB) at a potential of 5-7 kV using glycerol as matrix (see 
G.R. Fenwick, J. Eagles and R. Self, Biomedical Mass Spectrometry 10, 
382-386 (1983)). 



Protein assay 

Protein was assayed by the method of M.M. Bradford (Anal. Biochem. 
72, 248-254 (1976)), using Bio-Rad dye reagent (Bio-Rad Laboratories, 
Richmond, CA, USA) and bovine serum albumin as standard. 



15 Results 



Crude extracts of Ps. fluoresces biovar. V, strain AN103, from cells 
grown on vanillate together with rra/zj-ferulate, were able to produce 
vanillate when supplied with /ronj-ferulate, CoASH, ATP, Mg 2+ ions and 
NAD + . In the absence of NAD + , vanillate was not formed and vanillin 
accumulated in its place (Fig 4). The quantity of vanillin formed in the 
absence of NAD + was smaller than the amount of vanillate formed in its 
presence. Essentially no vanillin accumulated in the presence of NAD\ 

This utilisation of rra/w-ferulate by crude extracts was dependent upon 
CoASH and ATP and partially upon Mg 2+ ions (Table II). These 
properties indicate an initial activation of trans-ferulate to tamj-feruloyl 
SCoA by rra/u-feralate: CoASH ligase. This was further shown by the 
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rapid development of a CoASH-dependent absorbance maximum at 345 
nm and particularly by a transient bathochromic shift, causing the 
appearance of a yellow colour, if the reaction mixture was made alkaline 
with NaOH (data not shown). In the initial stages of the overall reaction, 
the linear increase in absorbance at 345 nm enabled the activity of 
rm^-ferulate: CoASH ligase to be assayed directly. During the later 
stages, however, absorbance at 345 nm would be contributed by both 
trans-fcmloyl SCoA and vanillin or, in the presence of NAD + , NADH, 
each of which has substantial absorbance at this wavelength. 

Table n: Cofactor requirements for muw-ferulate utilisation by P. 
fluoresce™ biovar. V, strain AN103 cell-free extracts 

Reaction mixtures (165 M g protein) were incubated for 4h at 30°C as 
described in Experimental, with omissions from the complete reaction 
mixture as indicated. 



Reaction 
Mixture 


Reaction products (nmol) 


Jra/is-ferulate 
(remaining) 


Vanillin 


Vanillate 


Complete 


267 


n.d. 


311 


-CoASH 


550 


n.d. 


40 


-NAD + 


258 


I 228 


23 


-ATP 


513 


n.d. 


33 


-Mg 2 * 


425 


n.d. 


153 



n.d. = not detectable 



An overall non-oxidative cleavage of mro-fcnitoyl SCoA is implied in Fig 
5 which shows, in the absence of NAD + , an equivalence between the 
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formation of vanillin and that of acetyl SCoA. (The formation of [2- l3 C] 
acetyl SCoA from rra/u-ferulate 13 C-labelled in the a-carbon atom was 
also confirmed by NMR spectroscopy (not shown).) The cleavage 
mechanism was investigated further by synthesising chemically the 
5 hydrated derivative of /ra/w-feruloyl SCoA, 4-hydroxy-3-methoxy- 
phenyl-0-hydroxypropionyl SCoA (HMPHP SCoA). This was incubated 
with cell-free extract and shown to be converted rapidly to acetyl SCoA 
and vanillin, in equimolar proportions (Fig 6). A smell of vanillin was 
obtained when HMPHP SCoA was used as a substrate. This hydrated 

10 intermediate was not only metabolised in the forward direction, however, 
since an almost equivalent back reaction to feruloyl SCoA (putatively 
trans) was also observed. The rapidity of utilisation of HMPHP SCoA 
was consistent with the failure to observed its accumulation, using HPLC, 
during cell-free incubations with /ra/w-ferulate, CoASH, ATP and Mg 2+ 

15 ions. This cleavage of HMPHP SCoA in the absence of NAD + indicated 
no intervening /3-oxidation to the 0-keto thioester 
(4-hydroxy-3-methoxybenzoyl) acetyl SCoA {cf. M.H. Zenk, Anal. Z 
Pflanzenphysiol 53, 404-414 (1965)). Attempts to prepare this compound 
for cell-free studies were unsuccessful, but its expected cleavage product, 

20 vanilloyl SCoA, was prepared and shown not to be metabolised to vaniiiin 
by cell-free extracts in the presence of NADH, even when simultaneous 
incubations with rranj-ferulate in the absence of NAD + actively produced 
vanillin. 

25 Besides trans-ferulic acid, caffeic acid and p-coumaric acids were 
converted to thioesters of CoASH by crude extracts of Ps. fluorescens 
biovar. V, strain AN 103 (Table III). 
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Table III. Formation of CoASH thioesters of /r^-hydroxycinnamic 
acids by crude extracts of Ps. fluoresces AN103. The activity was 
assayed spectrophotometrically as described for rra^-ferulate: CoASH 
ligase, measuring the initial rate of increase in absorbance at 345 nm in 
the case of m^-feruloyl SCoA formation, and at the corresponding 
absorbance maxima for the other SCoA thioesters. 



Substrate 


Activity (nkat/mg protein) 


Ferulate 


0.50 


Caffeate 


0.39 



Example 3: Mutants in tmn s-ferul^ mrt a K»i;^ 
Experimental 

Mutagenesis of Pseudomonas fluorescent biovar. V, strain AN 103 

Ethyl methanesulphonate (EMS) was used for mutagenesis. Bacteria were 
grown for 2 d at 25X in minimal medium (MM) with vanillic acid as 
carbon source; 1 ml of culture was then inoculated into 50 ml of LB-MOD 
and grown for 16 h at 25*C. The cells were centrifuged and resuspended 
in 0. 1M KH 2 P0 4 (1 .25 ml) to give a cell density of 4 x!0> cells/ml (OD^ 
of 1.0 = 6 x 10' cells/ml). An aliquot of this cell suspension was serial" 
diluted (10-M0-°) and plated onto LB-MOD plates (0.1 ml per plate) to 
provide control cell counts for assessment of the efficiency of 
mutagenesis. 
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A cell suspension (1 ml) was incubated with 0.08 ml of EMS in a total of 
3 ml of 0.1 M KH 2 PO< at 37°C for 45 min. The cells were then 
precipitated by centrifugation at 4°C and the cell pellet was washed twice 
with 10 ml of LB-MOD medium, prior to resuspension in 1 ml of this 
5 medium. An aliquot was serially diluted (10 2 -10 8 ) and plated onto LB- 
MOD plates; these were incubated at 25°C overnight, together with the 
plates of the unmutagenised cells, to obtain an estimate of kill (70% kill 
indicates efficient mutagenesis). The remaining mutagenised cells (0.9 ml) 
were inoculated into LB-MOD medium (50 ml) and incubated overnight 

10 at 25°C. The mutagenised cells were then enriched for mutants in trans- 
ferulate utilisation by treatment with carbenicillin in minimal medium 
(MM) in the presence of rro/is-ferulic acid. The cells were harvested by 
centrifugation at 4°C, washed twice with MM (10 ml) and resuspended in 
20 ml of MM. A sample (1 ml) was inoculated into MM (15 ml) and 

15 incubated at 25 °C for 1 h; then mww-ferulic acid (lOmM final 
concentration) and carbenicillin (2 mg/ml final concentration) were added. 
A control flask was prepared containing fra/u-ferulic acid, but not 
carbenicillin. Both flasks were incubated overnight at 25 °C for 16 h, 
monitoring OD J80 to estimate growth and confirm the effectiveness of the 

20 antibiotic. Penicillinase (10 units) was then added to destroy the 
carbenicillin, incubating overnight at 25°C. The cells were harvested by 
centrifugation at 4°C, washed twice in MM (10 ml) and resuspended in 
5 ml of MM; 1 ml of these resuspended cells were then inoculated into 50 
ml of MM containing lOmM vanillic acid and incubated at 25°C for ca. 

25 24 h. 

These enriched cells were screened by replica-plating for mutants unable 
to use rrans-ferulic acid. The enriched stock was diluted to 10* , plated 
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onto LB-MOD (0.1 ml per plate), incubated at 25°C for 2 d and then 
replica-plated onto MM containing lOmM vanillic acid or lOmM trans- 
ferulic acid. The plates were incubated at 25°C for 2-3 d and screened for 
colonies able to grow on vanillate but unable to grow on trans- ferutete. 

Results 



By mutagenesis of strain ANI03 with ethyl methane sulphonate, two 
classes of mutants unable to utilise /r^-ferulate as sole carbon source 
were isolated; these were van 1, van 2 and van 3 and, secondly, van 10 
and van 11. Following growth on vanillate plus /row-ferulate, a 
representative of the first of these, van 1, showed no activity in cell-free 
incubations with either rra^-ferulate or vanillin and lacked both 
rra^-ferulate: CoASH ligase and the enzyme that converts vanillin to 
vanillate, vanillin: NAD* oxidoreductase. In contrast, the type 
representative of the second class, designated van 10, possessed levels of 
activity of both ^-ferulate: CoASH ligase and vanillin: NAD* 
oxidoreductase similar to those found in strain AN103, but in the presence 
of NAD + generated very little vanillate (Table IV). Cell-free extracts of 
van 10 were examined further for their ability to metaoolise HMPHP 
SCoA. They metabolised this thioester actively, but appeared 
predominantly to dehydrate it to feruloyl SCoA; vanillin formation was 
substantially inhibited in comparison to the AN103 extract (Table V). 
These observations suggested that van 1 was a. regulatory mutant, 
defective in its induction by rro^-ferulate, whilst van 10 appeared to be 
defective in the HMPHP SCoA cleavage activity. 
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Table IV: 7/uns-ferulate metabolism in cell-free extracts of P. 
fluorescens biovar. V, strain AN103 and of mutant strains van 1 and 
van 10. 

5 Cells were grown for 6h in MM medium containing 10 mM vanillate 
together with 10 mM rraw-ferulate. Extracts were then prepared as in 
Example 2 and assayed for /raw-feralaterCoASH ligase and 
vanillin:NAD + oxidoreductase. Extracts (ca. 0.3 mg of protein) were also 
incubated for 4h in the presence of NAD + to determine relative amounts 

10 of vanillate formed. 



Strain 


Enzyme activity (nkat mg' 1 protein) 


Vanillate formed 


Ferulate: CoASH 
ligase 


Vanillin: NAD 
oxidoreductase 


(nmol mg* 1 protein) 


AN103 


1.7 


1.2 


807 


van 1 


0 


-0.05 


0 


van 10 


2.0 


1.4 


59 



Table V: Utilisation of HMPHP SCoA by extracts of Ps. fluorescent 
biovar. V, strains AN103 and van 10 



20 Extracts (AN103, 14 fig protein; van 10, 68 fig protein) were incubated 
at 30°C for 7 min in 1 ml vol containing 0.3 mM HMPHP SCoA, The 
increase in absorbance at 345 nm was measured against a blank reaction 
mixture containing no extract. Vanillin formation was measured by 
HPLC; the production of feruloyl SCoA was in this instance calculated 

25 from the increase in absorbance at 345 nm, after subtraction of the 
contribution from vanillin. 
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Strain 


AA345 


Feniloyl SCoA 
(nmol) 


Vanillin (nmol) 


AN103 


0.75 


18.4 


23.8 


van 10 


0.63 


29.2 


4.5 



Example 4: Induction of /nwv - frn.l ate mari-is- ;„ ^- ain AN1fn 
and purincation of HM PHP SP» A cleavagP 



Experimental 



Purification of trans-feruhyl SCoA hydrolase faldol cleavage 



enzyme 



Cells (from 2 1 of culture grown for 72 h on MM with lOmM rrow-ferulic 
acid as substrate; =0.5) were extracted essentially as described 

in Example 2 to give 50 ml of crude extract, containing 1.28 mg of 
protein/ml. 



Extract (16 ml, diluted to 40 ml), was applied at room temperature and 2 
ml/min to a Mono Q HR10/10 anion-exchange column (Pharmacia, 
Piscataway, NJ, USA), preequilibrated with 20 mM Tris buffer ( P H 7.5) 
containing 10 mM dithiothreitol. After elution of unadsorbed protein, 
protein bound to the column was eluted with a linear gradient of 
increasing NaCl concentration: from 0 to 0.5M NaCl in 100 ml of buffer. 

Fractions eluting between 0.18 and 0.3 M NaCl and containing activity 
with HMPHP SCoA, as deterrnined using the microtitre plate assay (see 
below), were pooled and buffer-changed by dialysis into 25 mM bis-Tfis 
buffer, containing 10 mM dithiothreitol and adjusted to pH 7.1 with 
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iminodiacetic acid. They were then applied at 0.75 ml/min to a Mono P 
HR 5/20 chromatofocusing column (Pharmacia), preequilibrated with the 
same buffer. After eluting unadsorbed protein from the column, adsorbed 
protein was eluted with a gradient of decreasing pH, generated by 
5 applying 46 ml of 10% (v/v) Polybuffer 74 (Pharmacia), containing 10 
raM dithiothreitol and adjusted to pH 4.0 with iminodiacetic acid. 
Activity with HMPHP SCoA was eluted between pH 5.5 and 5.1. 

The active fractions were again pooled together, and buffer-changed into 
io 20 mM Tris buffer (pH 7.5), containing 1.7 M (NH 4 ) 2 S0 4 and 10 mM 

dithiothreitol, using PD10 columns (Pharmacia), before application at 0.5 

ml/min to a PhenylSuperose HR 5/5 hydrophobic interaction 

chromatography column (Pharmacia) preequilibrated with this buffer. 

Elution of bound protein was achieved with a decreasing gradient of 
15 (NH 4 ) 2 S0 4 in buffer, from 1.7 M to zero over 30 ml and then continuing 

with buffer alone for a further 5 ml. Activity with HMPHP SCoA was 

eluted in this final 5 ml of buffer. 

At each stage of purification, active fractions were detected by a micro- 
20 adaptation of the assay with HMPHP SCoA described above (Example 2); 
reactions were performed in 100^1 of reaction mixture for ca. 4 min at 
room temperature in microtitre wells and absorbance was then measured 
in an MR 5000 microtitre plate reader (Dynatech, Guernsey, Channel 
Islands), equipped with a 340 nm filter. The activity of the pooled 
25 fractions was measured using HPLC to determine the reaction products of 
both HMPHP SCoA, (0.4mM) and fra/w-feruloyl SCoA (0.28mM) as 
substrates. Reaction mixtures containing lO/il of enzyme were incubated 
in 100 jliI volume as described above; the reaction was terminated with 
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100 M l of acidified MeOH (pH 3) after 2 min (HMPHP SCoA) or 5 rain 
(m^-feruloyl SCoA) of incubation at 24X. The proponionality of the 
reactions with time and with quantity of enzyme was established in 
preliminary determinations. 

Samples (lO^f enzyme at each stage of purification were analysed by 
SDS-PAGE, with Coomassie or silver staining, essentially as described by 
H. Schagger and G. von Jagow, Anal. Biochem. 166, 368-379 (1987). An 
Atto AE6450 gel apparatus was used (supplied by Genetic Research 
Instrumentation, Dunmow, Essex, UK). 

Electrocution of protein bands from fixed, stained gels was performed 
using a Bio-Rad Model 422 electroeluter according to the manufacturer's 
directions. Eluted protein was then deposited by centrimgation onto a 
Pro-Spin membrane (Applied Biosystems, Foster City, CA, USA) used in 
accordance with the manufacturer's recommendations). N-Terminal 
sequencing was performed by Alta Bioscience, University of Birmingham, 
Birmingham, UK. 

Results 

The time-course of induction of /r^-ferulate:CoASH ligase in strain 
AN103 is shown in Fig 7. Following transfer of vanillate-grown cells to 
medium containing rro/u-ferulate, the specific activity of the ligase in 
extracts increased approximately linearly over an 8-hour period. An 
essentially identical time-course was obtained if the cells were transferred 
instead to medium containing both rra/w-ferulate and vanillate. Vanillate, 
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a catabolite of taww-ferulate, therefore did not inhibit the induction 
process. 

Induction of the capacity to grow on a different substrate represents a 
5 significant shift in primary metabolism, which in principle might be 
detectable by electrophoresis of a crude protein extract. Cell-free extracts 
analysed by SDS-PAGE, with Coomassie staining (see Experimental), did 
indeed show a distinct difference in protein banding between 
franj-fenilate-grown cells and vanillate-grown cells (Fig 8). Extracts from 
10 rra/w-ferulate-grown cells exhibited a new, or very strongly enhanced, 
band corresponding to a polypeptide of molecular weight ca. 31 kD. 

N-Terminal amino acid sequencing of this polypeptide, following its 
removal from the gel by electroblotting, gave the following sequence: 
15 Ser-Thr-Tyr-Glu-Gly-Arg-Trp-Lys-^^ 
Ile-Ala-Phe (SEQ ID No 13). 

Purification of HMPHP SCoA-utilising activity was achieved by Fast 
Protein Liquid Chromatography (FPLC - Pharmacia). As described 
20 above, fractions were screened for activity with HMPHP SCoA using a 
microtitre plate scanner; active fractions were then pooled together and 
activity was then measured both with HMPHP SCoA and with trans- 
feruloyl SCoA, deterniining the reaction products by HPLC. 

25 The results of the purification are given in Tables VI and VII. Vanillin 
and acetyl SCoA were produced in approximately equimolar amounts, 
throughout the purification, with either HMPHP SCoA or taww-feruloyl 
SCoA as substrate. There was an approximate copurification of the 
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activities with trans - feraloyl SCoA and HMPHP SCoA as substrate, 
including the formation of feruloyl SCoA from HMPHP SCoA 
(dehydratase reaction; reverse of reaction II): vanillin-forming activity 
from rrow-feruloyl SCoA (reactions H+ni) was purified 11.5-fold, 
vanillin-forming activity from HMPHP SCoA (reaction III) was purified 
1 1 . 7-fold and the dehydratase reaction (reverse of reaction H) was purified 
9.1-fold. Approximately 20-25% of each of these activities was finally 
recovered. 



Table VI: Purification of ^-feruloyl SCoA hydratase/aldol cleavage 
enzyme. 

Purification from cells of Ps. fluoresces biovar V, strain AN103, was 
undertaken as described in Experimental, measuring activity - vanillin as 
product - with both rm/w-feruloyl SCoA and HMPHP SCoA as substrates. 
Values in parentheses show activity measuring acetyl SCoA as product. 



Part 1 



| Purification 
stage 


Total activ 


ity (nkat) 


Total 
protein 
(nig) 


Specific activity 
(nkat/mg) 


trans- 

feruloyl 

SCoA 


HMPHP 
SCoA 


irans- 

feruloyl 

SCoA 


HMPHP 
SCoA 


Crude 
Extract 


85.2 
(68.2) 


59.2 
(54.9) 


20.5 


4.16 


2.89 


Mono Q 
Fractions 


80.6 
(66.2) 


45.1 
(42.9) 


4.37 


18.4 


10.3 


Mono P 
Fractions 


56.9 
(51.9) 


35.4 
(33.4) 


1.61 


34.1 


21.2 


Phenyl 

Superose 

Fractions 


22.1 
(19.8) 


15.5 
(151) 


0.46 


48.0 


33.7 
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Part 2 



Purification 
stage 


Ratio of 
activities 


Purification (fold) 


Recovery (%) 


trans- 

feruloyi 

SCoA 


HMPHP 
SCoA 


trtms- 
feruloyl 


HMPHP 
SCoA 


Crude 
Extract 


1.44 


1.00 


1.00 


100 


100 


Mono Q 
Fractions 


1.79 


4.42 


3.56 


94.6 


76.2 


Mono P 
Fractions 


1.61 


8.20 


7.34 


66.8 


59.8 


Phenyl 

Superose 

Fractions 


1.42 


11.5 


11.7 


25.9 


26.2 



15 Table VII: HMPHP SCoA dehydratase activity during purification of 
fro/is-feruloyl SCoA hydratase/aldol cleavage enzyme 



Conditions and other data as Table VI. Dehydratase reaction measured as 
feruloyi SCoA production. 



Purification 
stage 


Total 
activity 
(nkat) 


Specific 
activity 
(nkat/mg) 


Purification 
(fold) 


Ratio to 
HMPHP SCoA 
Cleavage 
Activity* 


Crude 
Extract 


171 


8.34 


1.00 


2.89 


Mono Q 
Fractions 


121 


27.7 


3.32 


2.69 


Mono P 
Fractions 


101 


60.5 


7.25 


2.85 


Phenyl 

Superose 

Fractions 


34.9 


75.9 


9.10 


2.25 



* see Table VI. 
1 
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SDS-PAGE of the combined active fractions at each stage revealed the 
enhancement of a 31 kD protein band (Fig. 8), indicating purification to 
apparent homogeneity after chromatography on PhenylSuperose 
(Pharmacia). This band co-migrated with the band associated with growth 
of strain AN103 on tmns-kmtac and gave the same N-terminal amino- 
acid sequence: Ser-Thr-Tyr-Glu-Gly-Arg-Trp (SEQ ID No 14). 

Definitive proof of the catalysis of both reactions II and III by this protein 
was achieved as a result of expression of the gene in Escherichia coli. (see 
Example 5). 

The Mono-P-purified enzyme was able to accept, as alternative substrates 
to rra«5-feruloyI SCoA, /row-caffeoyl SCoA and ^-4-coumaroyl SCoA 
(Table VIII). 

Table VIII. Utilisation of rm^-p-hydroxycinnamoyl SCoA thioesters by 
/ram-feruloyl SCoA hydratase/aldol cleavage enzyme. Activity was 
determined at 30X and with 0.4 mM substrate using enzyme from Ps. 
fluorescens AN103 (2.8 M g of enzyme protein, partially purified by Mono- 
Q and Mono-P chromatography) as described in Experimental. 



Substrate 


Activity (nkat/mg of protein) 


Feniloyl SCoA 


0.60 


Caffeoyl SCoA 


0.36 


/7-Coumaroyl SCoA 


0.72 
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Example 5: Isola tion of the genes req ui red for the conversion of trans- 
feruloYl SCoA to vanillic acid (vanfflatel in Pseudomonas fluorescent 
strain AN103 

5 A strain of Pseudomonas fluoresces (biovar. V, AN103) was isolated 
from soil at the Institute of Food Research, Norwich Laboratory, which 
was able to grow on rra/ty-ferulic acid converting it to vanillic acid via 
vanillin. The proposed biochemical pathway for the conversion of trans- 
ferulic acid to vanillic acid shown in Fig 1 was substantiated in the 
experiments described above in Experiments 2-4. 

In order to clone the genes required for the conversion of rra/w-feruloyl 
SCoA to vanillic acid the strategy of complementing mutant derivatives of 
Ps. fluorescens AN103 that were unable to grow on fro/u-ferulate as sole 
carbon source was used. The isolation and characterization of mutants is 
described above in Example 3 and mutants vanlO and van 11 were used 
for clone isolation. As described in Example 3, these mutants appeared 
to be defective in a gene involved in the conversion of fra/is-feruloyl 
SCoA to vanillin. 

A genomic library of Ps. fluorescens AN 103 DNA was prepared in the 
mobilisable cosmid cloning vector pLAFR3 (B. Staskawicz, D. Dahlbeck, 
N. Keen, and C. Napoli, 7. Bad. 169, 5789-5794 (1987)). Genomic 
DNA was isolated from Ps. fluorescens AN 103 and partially digested 
with Sau 3A1 at 37°C for 7-10 min. The DNA was then size-fractionated 
on a NaCl gradient (1 ,25-5M). The fraction containing DNA of 20-40 kb 
was selected and 0.5 fig ligated into the dephosphorylated Bam HI site of 
the broad- host-range cosmid cloning vector, pLAFR3. One half of the 
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ligation mix was packaged into bacteriophage lambda particles using a 
Gigapack II XL kit (Stratagene, La Jolla, CA, USA). The packaged 
cosmids were transfected into Escherichia coli strain 803 (W.B. Wood, J. 
Mol. Biol. 16, 118-133 (1966)). Approximately 10,000 primary 
5 transfectants were obtained. The lawn of cells obtained was washed from 
the selection plates and glycerol-containing stocks prepared for storage at 
-70°C. 



) 



The genomic library of Ps. fluoresces AN103 DNA in cosmid pLAFR3 
was introduced into the two mutant Ps. fluoresces derivative strains 
vanlO and vanll using the helper plasmid, pRK20I3 (D. Figurski and 
D.R. Helinski, Proc Natl. Acad. Sci. USA 76, 1648-1652 (1979)). The 
mutant strains were inoculated into minimal medium MM containing 
lOmM vanillic acid and incubated at 25°C for 2 days. The Escherichia 
coli strain carrying the helper plasmid (E.coli 803pRK2013) was 
inoculated into LB-Mod medium (10 ml) and incubated at 37°C for 6 h. 
At the same time, 0.1 ml of the glycerol-containing stock of the AN103 
genomic library was similarly inoculated and incubated. The growth of all 
three cultures was monitored by measuring OD^ and appropriate volumes 
combined in a centrifuge tube to give equal populations of the three 
organisms. The mixture of cells was centrifuged, resuspended in a 
minimal volume of the supernatant solution and spread over a sterile 
gridded cellulose nitrate membrane filter (47 mm diam., Whatman, 
Maidstone, Kent, UK) on a moist LB-Mod agar plate. The suspension 
was allowed to air-dry onto the filter for a few minutes and then incubated 
overnight at 25°C. The bacteria were then washed from the filter using 2 
ml of H 2 0 and aliquots (0. 1 ml) were applied to selection plates consisting 
of MM agar with 10 mM vanillic acid and 5 „g/ml tetracycline. These 
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were incubated at 25 °C for 2 days and the colonies obtained (> 1000 per 
plate) were replica-plated to similar plates containing rra/tr-ferulic acid in 
place of vanillic acid; these were incubated similarly. Colonies (2-3 per 
plate) able to grow on the plates containing rra/iy-ferulic acid were selected 
5 and inoculated into fresh MM medium containing 10 mM trans-femlic 
acid and 5 /xg/ml tetracycline. Four such isolates in which the mutation 
in the Ps. fluoresces strains vanlO and vanll was complemented by the 
introduced cosmid were selected for further analysis. These strains were 
purified and the cosmid DNA was extracted by the mini-preparation 

io method of F.G. Grosveld, H.H.M. Dahl, E. Deboer and R. A. Flavell 
(Gene 13, 227-231 (1981)). The cosmid DNA was transformed into E. 
coli strain 803 and was again isolated as described by D.S. Holmes and 
M. Quigley (Anal. Biochem. 114, 193-197 (1981)). Two of the cosmid 
clones, pFI 793 and pFI 794, were isolated as complementing Ps. 

15 fluoresce™ mutant strain van 10, whereas cosmid clones pFI 795 and pFI 
796 complemented Ps. fluorescens mutant strain van 11. 



To test whether the plasmid clones pFI 793, pFI 794, pFI 795 and pFI 
796 would complement any of the other Ps. fluorescens mutants, each 

20 plasmid was introduced into Ps. fluorescens mutant strains vanl, van2, 
van3, vanlO and vanll. As described above in Example 3 the mutant 
strains vanl, van2 and van3 appear to be defective in a regulatory function 
that eliminates at least two different enzyme activities. The Ps. 
fluorescens van mutants were grown on MMO+ 10 mM vanillic acid agar 

25 medium for two days at 25°C. The strain carrying the helper plasmid 
(803pRK2013) was grown on LB-Mod agar + kanamycin (25 /ig/ml) at 
37°C overnight. The E.coli 803 cosmid clones carrying pFI 793, pFI 
794, pFI 795 and pFI 796 were grown on LB-Mod agar medium + 
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tetracycline (5 ^m.) at 3 >c overnight. Ifc bacteria were patch mated 
Oe a toopful „f donor , recipien[ and ^ ^ ^ ^ 

LB-Mod agar and incubated overnight at 25"C). The bacteria were 
replica plated onto selection medium (MMO + 10 mM mmr-ferulic acid 
5 + tetiacycline and MMO + ,0 mM vaniliic acid + teu.cyc.te) and 
incubated a. 25°C for two days. All four cosmid Cones pFI 793 pFI 
794. P FI 795 and P FI 7% complemented all of the van mutants (vanl 
va„2, van3, vanlO and vanl 1) enabling thetn t0 grow on ,rans- femlaK as 
sole carbon source, 

The cosmid clone DNAs were analyzed by digestion with restriction 
endouucleases Hind HI and Eco RJ to reveal insert DNA. Each of the 
four clones pFI 793. pH 794. pH 795 and pH 796 carried inserts of 
between 20 and 30 M, The four cosmid clones gave distinct restriction 
patterns, but appeared to share some resmction fragments of the same 
*ze. To identify restriction fragments that were common to the cosmid 
clones cosmid pH 793 was used as a probe against DNA of all four 
cosmtd DNA preparations that had been double-digested with restriction 
endonucleases Eco RI and ft, ,. Cosmid DNA was iso.a,ed using Qiag e„ 
mtd, columns according to the manufacarrers insmtctions and wasdigested 
wm restriction erctonucleases EcoXl and ft,I. The resulting fragments 
were sepanKed by agar ose gel electrophoresis and Southern btated ,„ a 
Hyb„nd-N filter as described by Sambrook e, at (Sambrook, J Fitsch 
E.F. and Maniatis. T. MoUcutar Oon ing : A Laboratory Manual, 2nd 
edwon, Cold Spring Harbor, NY, 1989). Cosmid pFI 793 DNA was 
linearised, denatured and labeled with digoxygenin prior to probing the 
Southern blo^ed DNA according to the insmtctions supplied by 
Boehrtnger (Lewes, Sussex, UK). The P FI 793 probe hybridised to al! of 
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the EcoRI/PstI restriction fragments of pFI 795 indicating that these two 
clones are identical. Excluding the vector band at least six EcoRI/PstI 
fragments appeared to be common to pFI 793, pFI 794, pFI 795 and pFI 
796. These were fragments of 6.6kb, 2.9kb, 1.8kb, 1.4kb, 1.25kb and 
1.1 kb. The 1.25kb fragment appeared to be a doublet or triplet. The 
DNA fragment patterns of the four cosmid clones after restriction 
digestion with EcoRI/PstI are shown in Figure 9. For clarity the ethidium 
bromide stained DNA bands have been diagrammatically superimposed on 
the original agarose gel. 



As described above in Example 4 protein analysis of Ps. fluoresces 
biovar. V, strain AN103 showed that cells grown on //ww-ferulate 
contained much larger amounts of a protein of about 31 kD compared with 
cells grown on vanillic acid. The twenty N-terminal amino acids of this 

15 protein were sequenced. This amino acid sequence was then used to 
design degenerate oligonucleotide primers that enabled the 60 bp sequence 
of DNA coding for this N-terminus to be amplified from pFI 793 by PCR 
(Figure 10). This sequence was used to probe EcoRI/PstI digests of the 
cosmid clones. By this technique the fragment containing the region of 

20 DNA encoding the 31 kD protein in each of the cosmid clones could be 
identified (Figure 11). This proved to be a fragment of 6kb in cosmids 
pFI 793 and pFI 795, a fragment of 4.3kb in cosmid pFI 794 and a 
fragment of 5.5kb in cosmid pFI 796. 

25 The 4.3 Kb EcoRI/PstI fragment of pFI 794 was sub-cloned into the E. coli 
vector pUC19 using strain XLI(Blue) and its nucleotide sequence was 
determined using an Applied Biosystems DNA Sequencer (Model 373: 
Perkin Elmer, Warrington, UK); together with the manufacturer's Taq 
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DyeDeoxy Terminator Cycle sequencing kit. A primer walking strategy 
was used with oligonucleotide primers being synthesized on an ABI 392 
Synthesizer (Perkin Elmer, Warrington, UK). The DNA sequence of the 
4.3kb fragment is presented in Figure 12. The open reading frame 
5 encoding the same airuno-terminus as determined previously starts at 
position 2872 and is terminated by a stop codon at position 3700. This 
ORF of 828 bp encodes a protein 276 amino acids long with a molecular 
size of 31.010 kD in good agreement with the protein gel analysis. The 
translated ammo-acid sequence of this gene is also presented in Figure 12. 
) In order to confirm the function of this gene it was sub-cloned and 
expressed in E. coli. From the DNA sequence PCR primers were 
designed to amplify the gene such that it was flanked by restriction 
endonuclease sites EcoKl and BamHl. The amplified gene retained its 
native ribosome binding site being initiated at base -29 and ending 6bp 
downstream of the stop codon. The amplified fragment was cloned into 
the equivalent sites of the E. coli expression vector pSP72 (Promega, 
Southampton, UK) and transformed into E. coli JM109(DE3). 

The E. coli 803 clones carrying the hydratase/cleavage enzyme gene, plus 
a putative promoter, as a PCR product in the vector P RK415 were used 
in a triparental patch mating experiment essentially as described earlier in 
relation to complementation by the cosmid clones pFI 793-6. The 
complemented van 10 strain was demonstrated to have recovered the 
ability to grow on rr^-ferulic acid, confirming directly that the mutation 
in van 10 resided in the gene encoding the rronMeruIoyl SCoA 
hydratase/cleavage enzyme. 
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The presence of a novel enzyme activity (c/. Example 4) in the E. coli 
clone was demonstrated. E. coli cells were grown at 37°C for 3 h in 
50ml of L medium, containing ampicillin (50 M g/ml) with and without 
induction by IPTG. Extracts were prepared as described above in 
Example 2 for Ps. fluoresces AN103, but without centrifugation. The 
crude extract was used for assay. Enzyme activity with both HMPHP 
SCoA and mww-feruloyl ScoA was determined as described above in 
Example 4 using HPLC to determine the reaction products. The results 
presented in Table IX clearly demonstrate that vanillin and acetyl SCoA 
were produced in equimolar proportions both with /ra/w-feruloyl SCoA 
and with HMPHP SCoA as substrates. In addition, HMPHP SCoA was 
also dehydrated to feruloyl SCoA, putatively tawis-feruloyl SCoA, 
demonstrating the reverse of activity II. These results are closely similar 
to those obtained with the vanillin-forming cleavage enzyme purified from 
Pseudomonas fluoresce™ AN103, although the ratio of activities with 
trans - feruloyl SCoA and HMPHP SCoA differs slightly between the two 
preparations. There was no activity with either tawj-feruloyl SCoA or 
HMPHP SCoA in extracts of an unmanipulated E. coli strain, whether 
induced or not. In the manipulated strain E. coli 1039 that expresses the 
Pseudomonas gene the specific activity was slightly lower in the e AU act 
made following induction than in that made from uninduced bacteria. 
Since the assay measures only active enzyme it is conceivable that 
increased protein expression occurs upon induction but this may result in 
incorrectly folded and therefore inactive enzyme. It was not possible to 
detect expression of the 31kD protein visually on Coomassie-stained, one- 
dimensional SDS gels because of its co-migration with the strongly 
expressed /^-lactamase encoded by die vector ampicillin resistance marker. 
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The DNA downstream of the gene encoding the 3 lkD protein was targeted 
for cloning and sequencing and for analysis of additional open reading 
frames. A PCR-generated probe was used to identify an overlapping Xhol 
fragment of 1.5 kb. Sequencing from this fragment and subsequently 
5 directly from the parent cosmid clone pFI794 revealed a second open 
reading frame of 1449 bp beginning at base 3804. The translation of this 
nucleotide sequence revealed a polypeptide of 483 amino acids. 
Comparison with sequences in the databases revealed appreciable 
homology to salicylaldehyde: NAD + oxidoreductase. 

10 

In order to confirm the function of this gene, expression was determined 
in E. coli strain DH5, which contained the vector pUC18 into which the 
full-length open reading frame had been inserted such that expression was 
from the lac promoter on the vector. Vanillin: NAD + oxidoreductase 
15 activity was confirmed and was absent from a control strain bearing the 
unmodified pUC 18 vector. Using the enzyme assay described in 
Example 2, activity with vanillin as substrate was determined as 3.0 
nkat/mg of protein; activity with salicylaldehyde was 2.8 nkat/mg. 

20 Additional sequence analysis of DNA cloned from Ps. fluorescens AN 103 
was undertaken using cosmid clone pFI 793. The 1.8, 0.9 and 0.8 kb 
EcoRI/PstI fragments were sub-cloned into E. coli vector pUC18 and their 
nucleotide sequences were determined. Sequencing the 0.9 kb sub-clones 
revealed that there are two different fragments of the same size. The 

25 nucleotide sequences of DNA fragments of 1837bp, 960bp, 959bp and 
854bp in sub-clones pFI 901, pFI 912, pFI 913 and pFI 91 1 respectively 
are presented in Figures 13 to 16. Outward reading PCR primers were 
designed from the ends of each of the four sequences as shown in Figure 
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17. Use of these primers in all possible pairwise combination with pFI 
793 as template showed that the 1.8 kb fragment of pH 901 was separated 
from the 959b P fragment of pFI 913 by 1.5 kb on the cosmid DNA 
(Figure 18). Direct sequence analysis of this 1 .5 kb PCR product enabled 
this together with the 1.8 kb and 959bp fragments to be merged into one 
larger fragment of 4.3 kb (Figure 19). 

Example 6: Production of vanillin fmm /™„c_ fe r ulnv» SPnA anA 
enzyme activities IT and m 

W-feruloyl SCoA was synthesised as described in Example 2, and was 
used as a substrate of the /ro/ir-feruloyl SCoA hydratase/aldol cleavage 
enzyme (ie a single polypeptide with enzyme activities II and III) as 
purified by the method described in Example 4. Vanillin was produced 
from rr<a/w-feruloyl SCoA. 

Example 7; A transgenic tobacco nlant W M c h prnrf.,r~ v a „m;„ 

Nicotiana tabacum (tobacco) is transformed using a strain of 
Agrobacterium tumefaciens which has been modified so mat it transfers the 
Ps. fluoresces gene encoding enzyme activities II and III (see Example 
5) to the tobacco plant. The tobacco plant produces vanillin in those parts 
of the plant which have mwj-feruloyl SCoA present, at least in the form 
of a vanillin glycoside. 



WO 97/35999 



PCT/GB97/00809 



86 

CLAIMS 

1. A method of producing vanillin comprising the steps of 

5 (1) providing mwu-ferulic acid or a salt thereof; and 

(2) providing fronj-feruIate:CoASH ligase activity (enzyme 
activity I), frany-feruloyl SCoA hydratase activity (enzyme 
activity II), and 4-hydroxy-3-methoxyphenyl-j8-hydroxy- 
propionyl-S-CoA (HMPHPSCoA) cleavage activity (enzyme 
10 activity III). 

2. A method according to Claim 1 wherein means for converting 
vanillin to a non-vanillin product is absent or reduced. 

15 3. A method according to Claim 1 wherein the enzyme activities I, II 
and III are provided by Pseudomonas fluoresceins biovar. V, strain 
AN 103 as deposited under the Budapest Treaty at the National 
Collections of Industrial and Marine Bacteria Limited, Scotland 
under Accession No NCIMB 40783, or a mutant or variant thereof. 

20 

4. A method according to any one of Claims 1 to 3 wherein the 
enzyme activities I, II and III are provided by an intact-cell-free 
system of Pseudomonas fluoresce™ biovar. V, strain AN103 as 
deposited under the Budapest Treaty at the National Collections of 
25 Industrial and Marine Bacteria Limited, Scotland under Accession 

No NCIMB 40783, or a mutant or variant thereof. 
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5. A method according to Claim 2 wherein means for convening 
vanillin to a non-vanillin product is an activity that interconverts 
vanillin and vanillic acid (enzyme activity IV). 

6. A method according to Claim 5 wherein NAD + is absent. 

7. A method according to Claim 1 or 2 further comprising the step of 



12. 



(3) providing any one of the cofoctors Coenzyme ASH, ATP 
Mg 2+ , or other functionally equivalent cofactors. 



8. A method according to Claim 7 wherein either one of the cofactors 
Coenzyme ASH and ATP is recycled or generated. 

9. A method according to Claim 8 wherein Coenzyme ASH is 
recycled using the enzymes citrate synthase and citrate lyase. 

10. A method according to Claim 8 wherein ATP is generated using 
the enzymes- adenylate kinase and acetate kinase. 

11. A method according to any one of Claims 1 to 10 wherein the 
/ra/w-femlic acid or salt thereof is provided by action of trans- 
ferulic acid esterase on plant material, said plant material 
containing an ester of //ww-fenilic acid. 



A method according to any one of Claims 1 to 1 1 wherein at least 
one of the enzyme activities II or III is provided by a substantially 
purified enzyme. 
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13. A method according to any one of Claims 1 to 12 further 
comprising the step of providing a compound other than trans- 
ferulic acid or a salt thereof which may be converted by any one of 
enzyme activities I, II or III into a desirable product. 

5 

14. A method according to Claim 13 wherein said compound is any of 
/ra/w-4-coumaric acid or a salt thereof, frans-4-coumaroyl S CoA, 
rra/zs-caffeic acid or a salt thereof, fro/w-caffeoyl SCoA or 3,4- 
methylene dioxy-rra/w-cinnamic acid or a salt thereof. 

10 

15 . A method according to Claim 14 wherein said compound is transA- 
coumaric acid or a salt thereof or frans-4-coumaroyl SCoA. 



16. A method of producing vanillic acid, or a salt thereof, comprising 
15 the steps as defined in any one of Claims 1, 3, 4 and 7 to 12 and 

the further step of providing enzyme activity IV. 

17. A method according to any one of Claims 1 to 12 comprising the 
further step of separating vanillin from the other reaction 

20 components. 



18. A method according to Claim 16 comprising the further step of 
separating vanillic acid, or a salt thereof, from the other reaction 
components. 

25 

19. Pseudomonas fluorescens biovar V, strain AN103 as deposited 
under the Budapest Treaty at the National Collections of Industrial 
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and Marine Bacteria Limited, Scotland under Accession No 
NCIMB 40783, or a mutant or variant thereof. 

20. A mutant of Pseudomonas fluoresces biovar V, strain AN103 

according to Claim 18 which accumulates vanillin when provided 
with rra/w-ferulic acid or a salt thereof. 



A polypeptide which, in the presence of appropriate cofactors if 
any, is capable of catalysing the interconversion of mvw-femloyl 

SCoAand^hyoroxy-S-methoxyphenyl-iS-hydroxy-propionylSCoA 
(HMPHP SCoA). 



A polypeptide according to Claim 21 which is a rro/w-feruloyl S 
CoA hydratase. 

A polypeptide which, in the presence of appropriate cofactors if 
any, is capable of catalysing the interconversion of 4-hydroxy-3- 
methoxyphenyI-/3-hydroxy-propionyl SCoA (HMPHP SCoA) and 
vanillin. 



A polypeptide according to Claim 23 which is a HMPHP SCoA 
cleavage enzyme. 

A polypeptide which, in the presence of appropriate cofactors if 
any, is capable of catalysing the interconversion of /ro/u-feruloyl- 
S-CoA and vanillin via HMPHP SCoA. 
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A polypeptide according to any one of Claims 21, 23 and 25 
comprising the amino acid sequence 

MetSerThrTyrGluGlyArgTrpLysThrValLysValGluIleGluAspGlylleAla 
PheVall leLeuAsnArgProGluL^ 

MetlleAspValLeuGluThrl^uGluGlnAspProAlaAlaGlyValLeuValLeuThr 
GlyAlaGlyGluAlaTrpThrAlaGl^ 

AlaGlyProGluI leLeuGlnGluLys I leArgArgGluAlaSerGlnTrpGlnTrpLys 
LeuI^uArgMetTyrAlaLysProThrlle^ 

GlyPheSerProLeuValAlaCysAspI^uMalleCysAlaAspGluAlaThrPheGly 
I^uSerGluIleABnTrpGlylleProPr^ 

ThrValGlyHisArgGlnSerLeuTyrT^ 

LysAlaAlaGluMetGlyLeuValAanGluSerValProLeuAlaGlnLeuArgGluVal 

ThrlleGluI^uAlaArgAsnLeuLeuGluLysAsnProValVall^uArgM 

HisGlyPheLysArg^sArgGluI^uThr^ 

LysLeuAspGlnSerArgLeuLeuAspThrGluGlyGlyArgGluGlnGlyMetLysGln 
PheLexiAspAspLysSerlleLysProGlyLeuGlnAlaTyrLysArg 

(SEQ ID No 2) or a fragment or variant thereof. 

A vanillin: NAD + oxidoreductase comprising the amino acid 
sequence 

Metl^uAspValProLeuI^uIleGlyGlyGlnSerCysProAlaArgAspGlyArgThr 

PheGlxiArgArgAsnProValThrGlyGluLeuValSerArgValAlaAlaAlaThrLeu 

GliiAspAlaAspAlaMaValAlaAlaAlaGlnGlnAlaPheProAlaTrpAlaAlaLeu 

AlaProAsnGluArgArgSerArgLeuteuLys^ 

GlyGluPhelleGluAlaAlaGlyGluThrGlyAl^ 

ValArgLeuAlaAlaAsnMetLeuArgGluAlaLAlaSerMetThrThrGlnValAsnGly 
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GluVallleProSerAspValProGlySerPheAlaMetAlaLeuArgGlnProCysGly 

ValValLeuGlylleAlaProTipAsnAlaProVallleLeuAlaThrArgAlalleAla 

MetProLeuAlaCysGlyAsnThrValValLeuLysAlaSerGluLeuSerProAlaVal 

HisArgLeuIleGlyGlnValLeuGlnAspAlaGlyLeuGlyAspGlyValValAsnVal 

IleSerAsnAlaProAlaAspAlaAlaGlnlleValGluArgLeuIleAlaAsnProAla 

ValArgArgValAsnPheThrGlySerThrHisValGlyArglleValGlyGluLeuSer 

AlaArgHisLeuLysProAlaLeuLeuGluLeuGlyGlyLysAlaProLeuLeuValLeu 

AspAspAlaAspLeuGluAlaAlaValGlnAlaAlaAlaPheGlyAlaTyrPheAsnGln 

GlyGlnlleCysMetSerThrGluArgLeuIleValAspAlaLysValAlaAspAlaPhe 

ValAlaGlnLeuAlaAlaLysValGluThrLeuArgAlaGlyAspProAlaAspProGlu 

SerValLeuGlySerLeuValAspAlaSerAlaGlyThrArglleLyaAlaLeuIleAsp 

AspAlaValAlaLysGlyAlaArgLeuVallleGlyGlyGlnLeuGluGlySerlleLeu 

GlnProThrLeuLeuAspGlyValAspAlaSerMetArgLeuTyrArgGluGluSerPhe 

GlyProValAlaValValLeuArgGlyGluGlyGluGluAlaLeuLeuGlnLeuAlaAsn 

AspSerGluPheGlyLeuSerAlaAlallePheSerArgAspThrGlyArgAlaLeuAla 

LeuAlaGlnArgValGluSerGlylleCysHisIleAsnGlyProThrValHisAspGlu 

AlaGlnMetProPheGlyGlyValLysSerSerGlyTyrGlySerPheGlyGlyLysAla 

SerHeGluHisPheThrGlnLeuArgTrpValThrLeuGlnAsnGlyProArgHisTyr 
Prolle 

(SEQ ID No 4) or a fragment or variant thereof. 

A polypeptide as defined in any one of Claims 21 to 27 which is 
substantially pure. 



A polynucleotide encoding a polypeptide as defined in any one of 
Claims 21 to 27. 
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A polynucleotide comprising at least a part of the Pseudomonas 
fluoresces DNA contained within the cosmid clone pFI 793 as 
deposited under the Budapest Treaty at the National Collections of 
Industrial and Marine Bacteria Limited, Scotland under Accession 
No NCIMB 40777, or a fragment or variant thereof. 

Cosmid pFI 793 as deposited under the Budapest Treaty at the 
National Collections of Industrial and Marine Bacteria Limited, 
Scotland under Accession No NCIMB 40779. 

A polynucleotide according to Claim 30 encoding a polypeptide as 
defined in any one of Claims 21 to 27. 

A polynucleotide according to any one of Claims 29, 30 or 32 
comprising the nucleotide sequence 

ATGAGCACATACGAAGGTCGCTGGAAAACGGTCAAGGTCGAAATCX3AAGACGGCATCGCG 

TTTGTCATCCTCAATCGCCCGGAAAAACGCAACGCGATGAGCCCGACCCTGAACCGCGA 

ATGATCGATGTTCTGGAAACCCTCGAGCAGGACCCTGCCGCCGGTGTGCTGGTGCTGACC 

GGTGCGGGCGAAGCCTGGACCGCAGGCATGGACCTCAAGGAATACTTCCGCGAAGTGGAC 

GCCGGCCCGGAAATCCTCCAGGAAAAAATCCGCCGCGAAGCCTCGCAATGGCAATGGAAA 

CTGCTGCGCATGTACGCCAAGCCGACCATCGCCATGGTCAATOSCTGGTGm 

GGTTTCAGCCCGCTGGTGGCCTGCGACCTGGCGATCTGCGCCGACGAAGCAACCTTCGGT 
CTCTCGGAAATCJ^CTCKJGGTATrc 

ACCGTGGGCCACCGCCAGTCGCTCTACTACATCATGACCGGCAAGACCTrCGGTGGGCAG 
AAAGCCGCCGAGATGGGCCTGGTCAACGAAAGCGTGCCCCTGGCGCAACTGCGCGAAGTC 
ACCATCGAGCTGGCGCGTAACCTGCTCGAAAAAAACCCGGTGGTGCTGCGTGCCX3CCAAA 
CACGGTTTCAAACGCTGCCGCGAACTGACCTGGGAGCAGAACGAGGATTACCTGTACGCC 
AAGCTCGATCAGTCGCGTTTGCTGGACACCGAAGGCGGTC^CGAGC^GGGCATGAAGCAA 
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TTCCTCGACGACAflGAGCATCAAGCCTGGCCTGCAAGCGTATAAACGC 

(SEQ ID No 1) or a fragment or variant thereof. 

A polynucleotide according to any one of Claims 29, 30 or 32 
comprising the nucleotide sequence 

ATGCTGGACGTGCCCCTGCTGATTGGCGGCCAGT^ 
TTCGAGCX3CCGCAACCCGGTGACTCGCGAGTTGGTGTC 

GAAGATGCCGACGCCX^CGTGGCCGCTGCCCAGCAAGCXiaTTCCCGCGTGQGCCG 
GCGCCCAAT^CGGO^GCCCnTTGCTCAA^CC^^ 

GGCGAGTTCATCGAGGCGGCGGGCGAGACCGGCGCCATGGCCAACTG 

GTACGGCTGGCXJGCCAACATGCTGCGTGAAGCGGCATCGATGA^ 

GAAGTGATTCCCTCGGACGTTCCCGGCAGTITCGCCATGGCCCTGC^ 
GTGGTGCIX^TCGCCCCCT^C^ 

ATCCCGCTGGCCTGTCGO^CACCGTGGTGCTCAAGGCTTrc 

CATCGCTTGATCGGCC^GGTGCTGCAGGACGCCGGCCTGGGCGATGGCC 

ATCAGTAATGCGCCGGCGGATGCGGCACAGATTGTCGAGCGCCTGATTGCC^ 

GTACGCCGGGTCAATTTCACCGGTTCGACCCACGTCGGGCGCATTGTCGGCGAGCT 
GCGCGCCACCTCAAACCGGCXrTTGCTCXiAGCTGGGCGGCAAG 

GACGATGCCGACCTGGAGGCTGCCGTGCAGGCGGCGGCGTTTGGCGC^ 

GGACAGATCTGTATGTTCCACCGAGCGCCTGATTGTCGATGCCAAGGTGGCCGACGC 
CrrCGCCCAGTTGGCGGCCAAGGTC^GACCCTGCGCGCO^ 

TCGGTGCTCGGTTCGCTGGTGGACGCCAGCGCTGGCACGCGGATCTi^^ 

GATGCCGTGGCCAAGGGCGCGCGCCTGGTAATCGGQSGGCAACTGG^ 

cagccgaccctgctcgacggtgtc^cgcgac^tgcg™ 

GGCCCGGTGGCGGTGGTGCTGCGCGGCGAGGGCGAAGAAGCGCTGTTGCAAC^^ 
^CTCC^GTTCGGTTTGTCGGC^CGATTTTCAGTO^CACCG^ 

CTGGCCCAGCGGGTCGAATCGGGCATCTGCCACATCAACGGCCCGACCGTGCACGACGAA 
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CKTGCAAATGCCTTTTGGCGGG^ 

TCC^TTGAGCATTTCACT^ 

CCGATC 

5 (SEQ ID No 3) or a fragment or variant thereof. 

35 . A nucleic acid vector comprising a polynucleotide according to any 
one of Claims 28 to 34. 

10 36. A host cell comprising a polynucleotide as defined in any one of 
Claims 28 to 34. 

37. A host cell according to Claim 36 comprising nucleic acid which 
encodes any one of the polypeptides as defined in Claims 21, 23 

15 and 27. 

38. A host cell according to Claim 36 or 37 which is a bacterium or 
yeast. 

20 39. A host cell according to Claim 38 which is a food-grade bacterium 
or yeast. 

40. A host cell according to Claim 39 which is a Lactococcus spp. or 
a Lactobacillus spp. or Saccharomyces cerevisiae or a biovar 
25 thereof. 



41. 



A host cell according to Claim 37 which is a plant cell. 



WO 97/35999 „ 

PCT/GB97/00809 



42. 



95 

A host cell according to Claim 41 wherein said plant cell is a cell 
from any one of Nicotiana spp. , Solanum tuberosum, Brassica spp. , 
Beta spp.. Capsicum spp. and Vanilla spp. 



43. A host cell according to Claim 41 or 42 wherein said cell is 
comprised in a plant. 



44. 



45. 



46. 



47. 



48. 



A transgenic plant comprising at least one polynucleotide according 
to Claim 29 and which, as a consequence of the presence of said 
polynucleotide, expresses any of the enzyme activities II and m. 

A transgenic plant according to Claim 45 which, as a consequence 
of the presence of said polynucleotides, expresses the enzyme 
activities II and III. 

A transgenic plant according to Claim 44 or 45 wherein said plant 
is selected from Nicotiana spp. , Solanum tuberosum, Brassica spp. , 
Beta spp., Capsicum spp. and Vanilla spp. 

A method according to Claims 1 to 18 wherein the enzyme 
activities II and III are provided by the host cell according to any 
of Claims 36 to 43 or a transgenic plant according to any one of 
Claims 44 to 46, or an extract thereof. 

A method according to Claims 1 to 18 wherein the enzyme 
activities I, II and HI are provided by a microorganism which can 
convert trans-ferulfc acid to vanillin. 
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49. Use of Pseudomonas fluorescens biovar. V, strain AN103 or a 
mutant or derivative thereof for producing vanillin, or vanillic acid 
or salt thereof. 

5 50. Use of a polypeptide according to any one of Claims 21 to 28 for 
producing vanillin, or vanillic acid or salt thereof. 

51. Use of a polynucleotide according to any one of Claims 29 to 35 
for producing vanillin, or vanillic acid or salt thereof, 

10 

52. Use of a host cell according to any one of Claims 36 to 43 for 
producing vanillin, or vanillic acid or salt thereof. 

53. Use of a transgenic plant according to any one of Claims 44 to 46 
15 for producing vanillin, or vanillic acid or salt thereof. 

54. Use of Pseudomonas fluorescens biovar V, strain AN103 or a 
mutant or derivative thereof for converting a compound, other than 
rraws-ferulic acid or a salt thereof, into a desirable product. 

20 

55. Use of a polypeptide according to any one of Claims 21 to 28 for 
converting a compound into a desirable product. 

56. Use of a polynucleotide according to any one of Claims 29 to 35 
25 for converting a compound into a desirable product. 



57. 



Use of a host cell according to any one of Claims 36 to 43 for 
converting a compound into a desirable product. 
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58. Use of a transgenic plant according to any one of Claims 44 to 46 
for converting a compound, other than trans-fendk acid or a salt 
thereof, into a desirable product. 

59. Use according to any one of Claims 54 to 58 wherein said desirable 
product is a flavour or aroma. 

60. Use according to Claims 54 to 59 wherein said compound is any 
one of/ro/w-4-coumaroyl S CoA and mro-caffeoyl SCoA. 

61 . A food or beverage comprising a host cell according to any one of 
Claims 36 to 43, or an extract thereof. 

62. A food or beverage comprising a transgenic plant according to 
Claim 44 to 46, or a part or extract thereof. 

63 . A method of producing vanillin comprising 

(1) providing fra/w-feruloyl SCoA; and 

(2) providing mra-feruloyl SCoA hydratase activity (enzyme 
activity II), and HMPHP SCoA clevage activity (enzyme 
activity III). 

64. Any novel method of producing vanillin or vanillic acid 
substantially as disclosed herein. 



Any novel polypeptide substantially as disclosed herein 
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66. Any novel polynucleotide substantially as disclosed herein. 
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Vanillin Pathway in Pseudomonas fluorescent, biovar V, strain AN103 
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Figure 7 
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Fig 8. 




97.4 kDa Phosphorylase b 
66.2 kDa Bovine serum albumin 
45.5. kDa Hen egg white ovalbumin 
31 .0 kDa Bovine carbonic anhydrase 

21 .5 kDa Soybean trypsin inhibitor 
1 4.4. kDa Hen egg white lysozyme 
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EcoRI/Pstl digests of cosmid clones 
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Figure 10 
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Nucleotide and amino acid sequence of the cleav age enzvme and vanillin dehydrogenase 
Sequence determined from sub-cloned fragments pFI9RQ and pFllQSfi and from the cnsmiri 

clone pFI794 



1 


GAATTCTCTCGCGCTTTGCCCAGTCCTACCCGCTGGTGCAGATCGAGGTG 


50 


51 


CATTGCGAGTCGTCCAAGCAACTTCTCTTGCGCCAGGACCTCGACCTGTC 


100 


101 


CATCGTGACCCGCGAACCCGGCAACGAAATCGGCCAGCTGTTGCGCAAGG 


150 


151 


AGCGTTTTGTCTGGGCCCAGGCCCAGTGCTACAACCCTGTCGAGCAATCA 


200 


201 


CCCTTGCCGCTGGCGATGTTCAACAGTGACTGCTTOTGCCGTCTTTGGGC 


250 


251 


CTGTAATGCGCTGGATGCCGCCGGACGTGAATACCGCATCGCCTACAACA 


300 


301 


GTTCGAGCCTGTCGGCGCTGATGGCAGTGGTCAGCGCCGGGCTGGCGGTG 


350 


351 


ACCGCTCAATTGGAAAGCCTGATCCCGCAGGACATGCGCATCCTCGGCGA 


400 


401 


GGCCGAAGGCCTGCCCCAACTGCCCGAGGCGAGCATCATGCTGCTGCGCA 


450 


451 


ATCTGCATAATCCGTCGCCGATTACCGAATGTCTGGCGGAGCACATCGTC 


500 


501 


GAAGGCTTCAAACTTTAAAGGCGAGCATCACCGCGCAGAGCACCAGAAAA 


550 


551 


CCGCAGAACAAACCGCGTAGTAGCCGCTCCGGCAGGGCGTGGGCAATCTT 


600 


601 


CACGCCCCAACTGATACTGAGCAGACCGCCCACCGCCATGGGCAACGCGA 


650 


651 


TGTGCCAGTCGACTTGCTGGTGCAGGGCGTAGGTCGCCAGGGTTACACCG 


700 


701 


GTGCTGGGTAAAGCCAGTGCCAACGACAGGCCCTGGGCCACCACCTGGGT 


750 


751 


GGTGCCAAACAAGCTGGTCAATACCGGCGTTGCGACCACAGCGCCGCCCA 


800 


801 


CCCCGAACAAGCCACCCATGACGCCGGACGCCGCGCCCAGCACCCCGAGC 


850 


851 


CAGGGCCACGAATAGCGCATCTGCGCAGTCGGTGCCGCCGCAGTCATGAA 


900 


901 


CATGCGCATCAGGTTGTAGACCGACAAGGCCACCAGAAACACCACGAACC 


950 


951 


CGATGCGCATCACCTGAGCGTCGATCCCCACCGCCCAGATCGAACCGAGC 


1000 


1001 


CAGGCAAAGCAGAACCCCATGGAGGCCAACGGCAGCGCGTGGCGCAACTC 


1050 


1051 


GATGCGATTACGTTGGTGATAACGCCATAACGCCAGCATCACGTTCGGCA 


1100 


1101 


CCACCATGACCAGAGCTGTGCCCTGGGCAAGCTGCTGATCCAGGCCAAAT 


1150 


1151 


AACACGCCCAGAGCGGGAATGGCGATCAAGCCACCGCCGATTCCAAACAA 


1200 


1201 


ACCACCTACGGTCCCCAAGGCTGCACCGAGCAGCAGGTACATCGTCAACT 


1250 


1251 


CAATCACAGGTCAAATTCCCTCACGTCAATGGGTGCATCCTACGCAGTCG 


1300 
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1301 GGGCTAGCGGGGAAACGCACAGCAACGCACAATGGCTATGCCAAATTCGC 1350 

1351 ACAAGC AATCCTCCATGAACCCCACAACGCTCACCG ATCAATTGGGTCTA 1400 

1401 TTCCTTGATGTCGTGGAAACCGGCAGTTTTTCCGCTGCGTCCCGACGCCA 14 50 

1451 TCCGCTGACACCCTCCGCCGTCGCCAGGCGTATCGATAGTCTGGAACAGG 1500 

1501 CGCTCGACAGCCAATTGTTCGTGCGCACCACTCATGCTGTGCGTCCTACG 1550 



1551 


CCAGCGGGATTGGCGTTTGCCGAGCGAGCCCGACGCATTGTCGGCGAGTT 


1600 


1601 


GCGCCTGGCGCGGGCCGAGGTCGCCTCCCTGAGCAGCGCGCCTGAAGGAC 


1650 


1651 


TGATTCGGGTCGACGCCCCCGCAGCCTTCGGCCGCAGGCACCTGGCGCCG 


1700 


1701 


GTCATCTATGACTACGACTTCGCCGCCTCCGGCGTGCCCGGGCTGCGGGG 


1750 


1751 


CCGGTTGCGCTACCTGCGCGGCGACAACATCGAGTTGAAAGCCTTCAACG 


1800 


1801 


CCGAAGACCGCAAGGAGCGCGAGTTCCAGATGGAGCTGGGCTACGTGGTG 


1850 


1851 


CAAAGCGGTCCGCTGAAAAACGTCGGCCTGGTGGCGCGCAAGGCAATCTA 


1900 


1901 


CCGCAATGACTTCCCCACTGGCGCCGCCTTCCGCGATGAAAACCAGACGC 


1950 


1951 


GGTTTCTGGTGACCTATACCTTGCCGATCTGGTGAGTGCGCGTGTTGCGG 


2000 


2001 


TGGGGCTGATGGCCCCATCGCGAGCGGGCTCGCTCCTACAGTGGGTTTGG 


2050 


2051 


TGTTAATCACAGAGGCTGTGGAGCTTGCAGCCCCTGTAGGCGCTGGCTTG 


2100 


2101 


CCAGCGAGGCGTAGGCACTGCTGGCGCAAGGCTCAAGGCCCCACAGGCCC 


2150 


2151 


GCTCCCACCOTTCAGATTTTCTATTCOTGATAAATCTTCTTCAGCAGCCG 


2200 


2201 


CAGCAGCTCGTCGCGTTCCTGGTCGTCCAGTGCCGAGGTGGCGTCGAGGT 


2250 


2251 


CGCTTTGGGCGGCGATCTGGTTCAGTTCCTTGAGCAGGGTCTCGCCGGTC 


2300 


2301 


TTGCTGAGGAATATCCCGTACGAGCGCTTGTCCGGCTTGCAGCGCACACG 


2350 


2351 


CACCGCCAGCGCCCGGCTTTCCAGCTTATTCAGCAGCGGTACCACCTGGG 


2400 


2401 


GCGGCTCGATGCTCAGAGCCCGGGCCAGGTCGGCCTGCATCAGGCCGGGG 


2450 


2451 


TTCTGATTGATGATCGCCAGCGCCGAGAATTGCGCGGGGCGCAGATCGTG 


2500 


2501 


GGCCGAGAGGCGGCTGATCAGGTTCTGGAACAGTTTCAGTTGCGCACGGC 


2550 


2551 


GCATGGCGTAGCCGATCAGATCATTCAGCGCCGAATCCATGGGCGCCTGG 


2600 


2601 


GTCTCGGCGGGAGTCGACGCAGCCTCGACCGACTCGGCGAGGGGGGAGGG 


2650 


265i 


CTTGGCCATTGCGGGGAAGTCCTGAAGATGGAGGTTAACAAGACTATCTA 


2700 


2701 


GTTTGCCGACCTTGGCCGGTGATTGCTACGGCCAATATCGCTCGGCGCCA 


2750 
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2751 AGACCGACCAGTCCATCACCTGCGAGAAAATTGGTTAAATCAATTAATAG 2800 

2801 TTAATTGACATAACTAATTCGCTGCTTTAATTTCGAGTCATCTTCAAAAC 2850 

2851 CCAGAACAAGAGAGCATCGCCATGAGCACATACGAAGGTCGCTGGAAAAC 2900 

Met SerThrTyrGluGlyArgTrpLy sTh 

2901 GGTCAAGGTCGAAATCGAAGACGGCATCGCGTTTGTCATCCTCAATCGCC 2950 
rValLysValGluIleGluAspGlylleAlaPheVallleLeuAsnArgP 

2951 CGGAAAAACGCAACGCGATGAGCCCGACCCTGAACCGCGAGATGATCGAT 3000 
roGluLysArgAsnAlaMetSerProThrLeuAsnArgGluMetlleAsp 

3001 GTTCTGGAAACCCTCGAGCAGGACCCTGCCGCCGGTGTGCTGGTGCTGAC 3050 
ValLeuGluThrLeuGluGlnAspProAlaAlaGlyValLeuValLeuTh 

3051 CGGTGCGGGCGAAGCCTGGACCGCAGGCATGGACCTCAAGGAATACTTCC 3100 
rGlyAlaGlyGluAlaTrpThrAlaGlyMetAspLeuLysGluTyrPheA 

3101 GCGAAGTGGACGCCGGCCCGGAAATCCTCCAGGAAAAAATCCGCCGCGAA 3150 
rgGluValAspAlaGlyProGluIleLeuGlnGluLysIleArgArgGlu 

3151 GCCTCGCAATGGCAATGGAAACTGCTGCGCATGTACGCCAAGCCGACCAT 3200 
AlaSerGlnTrpGlnTrpLysLeuLeuArgMetTyrAlaLysProThrll 

3201 CGCCATGGTCAATGGCTGGTGCTTCGGCGGCGGTTTCAGCCCGCTGGTGG 3250 
eAlaMetValAsnGlyTrpCysPheGlyGlyGlyPheSerProLeuValA 

3251 CCTGCGACCTGGCGATCTGCGCCGACGAAGCAACCTTCGGTCTCTCGGAA 3300 
laCysAspLeuAlalleCysAlaAspGluAlaThrPheGlyLeuSerClu 

3301 ATCAACTGGGGTATCCCGCCGGGCAACCTGGTGAGCAAGGCCATGGCCGA 3350 
IleAsnTrpGlylleProProGlyAsnLeuValSerLysAlaMetAlaAs 

3351 CACCGTGGGCCACCGCCAGTCGCTCTACTACATCATGACCGGCAAGACCT 3400 
pThrValGlyHisArgGlnSerLeuTyrTyrlleMetThrGlyLysThrP 

3401 TCGGTGGGCAGAAAGCCGCCGAGATGGGCCTGGTCAACGAAAGCGTGCCC 3450 
heGlyGlyGlnLysAlaAlaGluMetGlyLeuValAsnGluSerValPro 

3451 CTGGCGCAACTGCGCGAAGTCACCATCGAGCTGGCGCGTAACCTGCTCGA 3500 
LeuAlaGlnLeuArgGluValThrlleGluLeuAlaArgAsnLeuLeuGl 

3501 AAAAAACCCGGTGGTGCTGCGTGCCGCCAAACACGGTTTCAAACGCTGCC 
uLysAsnProValValLeuArgAlaAlaLysHisGlyPheLysArgCysA 



3550 



3551 GCGAACTGACCTGGGAGCAGAACGAGGATTACCTGTACGCCAAGCTCGAT 3600 
rgGluLeuThrTrpGluGlnAsnGluAspTyrLeuTyrAlaLysLeuAsp 

360] CAGTCGCGTTTGCTGGACACCGAAGGCGGTCGCGAGCAGGGCATGAAGCA 3650 
GlnSerArgLeuLeuAspThrGluGlyGlyArgGluGlnGlyMetLysGl 

3651 ATTCCTCGACGACAAGAGCATCAAGCCTGGCCTGCAAGCGTATAAACGCT 3700 
nPheLeuAspAspLysSerlleLysProGlyLeuGlnAlaTyrLysArgE 

37 01 GAAGGACGACGCTGCGGGCGCATTGCGCGAAGGCGAGTGCGCCCTGAAGC 3750 
nd 
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3800 



3801 ACCATGCTGGACGTGCCCCTGCTGATTGGCGGCCAGTCGTGCCCCGCGCG 3850 
MetLeuAspValProLeuLeuIleGlyGlyGlnSerCysProAlaAr 

3851 oA^T^T CCAGCOT ^ CCCGGTGAC ^ CGAG ^TGT 3900 
gAspGlyArgThrPheGluArgArgAsnProValThrGlyGluLeuValS 

3901 CGCGGGTTGCCOrCGCCACCCTGC^GATGCCGACGCCGCCGT^CCGCT 3950 
erArgValAlaAlaAlaThrLeuGluAspAlaAspAlaAlaVa^SA^ 

3951 



AlaGlnGlnAlaPheProAlaTrpAlaAlaLeuAlaProAsnGluArgAr 

4001 4050 
gSerArgLeuLeuLysAlaAlaGluGlnLeuGlnAlaArgSerGlyGluP 

4051 TCATCGAGGCGGCGGGCGAGACCGGCGCCATGGCCAACTGGTACGGGTTC 4100 
helleGluAlaAlaGlyGluThrGlyAlaMetAlaAsnTrpTyrt^Phe 

4101 A^n^^A^^^i^^^^^^^ G ^^^^^^^^^^^*^ 4150 
AsnValArgLeuAlaAlaAsnMetLeuArgGluAlaAlaSerMetThrTh 

4151 CCAGGTCAATGGTGAAGTCAITCCCT^^ 

rGXaValAsnGlyGluVallleProSerAspValProGlySe^SS 

4201 4250 

etAlaLeuArgGlnProCysGlyValValLeuGlylleAlaProTrpAsn 

4251 



4301 CAACACCGTGGTGCTGAAGGCTTCCGAGCTGAGTCCGGCGGTGCATCGTT 4,™ 
yAsnThrValValLeuLysAlaSerGluLeuSerPr^aTSSS 

4351 TGATCGGCCAGGTGCTGCAGGACGCCGGCCTGGGCGATGGCGTGGTCAAr 
ealleGlyGlnValLeuGlnAspAlaGlyLeuGlyAsSSvI^n 

4401 GTCATCAGTAATGCGCCGGCGGATGCGGCACAGATTGTCGAGCGCCTGAT 44 50 
VallleSerAsnAlaProAlaAspAlaAlaGlnlleValGluArgSun 

4451 TGCCAACCCGGCCGTACGCCGGGTCAATTTCACCGGTTCGACCCACGTCG <mnn 
eAlaAsnProAlaValArgArgValAsnPheThrGlysShrSsvIS 

4501 GGCGCATTGTCGGCGAGCTCTCGGCGCGCCACCTCAAACCGGCGTTGCTC 4«n 
IvArglleValGlyGluLeuSerAlaArgHisLeuLysP^allu^ ^ 



4551 G^S"^ 4600 
uAlaAlaValGlnAlaAlaAlaPheG'^Ia^P^^G^GS " 5 ° 



GluLeuGlyGlyLysAlaProLeuLeuValLeuAspAspAlaAspL^ 
4601 GGC ™ C ^^ 



4651 1^1^^^ 4700 
leCysMetSerThrGluArgLeuneValAspAlaLysValAlaAspAie 
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4701 TTTGTCGCCCAGTTGGCGGCCAAGGTCGAGACCCTGCGCGCCGGTGATCC 4750 
PheValAlaGlnLeuAlaAlaLysValGluThrLeuArgAlaGlyAspPr 

4751 TGCCGACCCGGAGTCGGTGCTCGGTTCGCTGGTGGACGCCAGCGCTGGCA 



— ™ w * * w * vjv— x \. w 1 1 ^vju i uu iTJUALGCCAGCGCTGGCA 

oAlaAspProGluSerValLeuGlySerLeuValAspAlaSerAlaGlyT 

CGCGGATCAAAGCGTTGATCGATGATGCCGTGGCCAAGGGCGCGCGCCTG 
hrArglleLysAlaLeuIleAspAspAlaValAlaLysGlyAlaArgLe 



4850 

Leu 

4851 GTAATCGGCGGGCAACTGGAGGGCAGCATCTTGCAGCCGACCCTGCTCGA 4900 
ValIleGlvRlvf51nt.onr:l ,.r!T.,o T i_r , _ _. _ 



VallleGlyGlyGlnLeuGluGlySerlleLeuGlnProThrLeuLeuAs 

CGGTGTCGACGCGAGCATGCGTTTGTACCGCGAAGAGTCCTTCGGCCCGG 
pGlyValAspAlaSerMetArgLeuTyrArgGluGluSerPheGlyPr 



4901 „_ :cQG 4gso 

ProV 

4951 TGGCGGTGGTGCTGCGCGGCGAGGGCGAAGAAGCGCTGTTGCAACTGGCC 5000 
alAlaValValLeuArgGlyGluGlyGluGluAlaLeuLeuGlnLeuAla 

5001 AACGACTCCGAGTTCGGTTTGTCGGCGGCGATTTTCAGTCGTGACACCGG 5050 
AsnAspSerGluPheGlyLeuSerAlaAlallePheSerArgAspThrGl 

5051 CCGTGCCCTG(XX:CTGGCCCAGCGGGTCGAATCGGGCATCTGCCACATCA 5100 
yArgAlaLeuAlaLeuAlaGlnArgValGluSerGlylleCysHisIleA 

5101 ACGGCCCGACCGTGCACGACGAAGCGCAAATGCCTTTTGGCGGGGTCAAG 5150 
snGlyProThrValHisAspGluAlaGlnMetProPheGlyGlyValLys 

5151 TCCAGCGGCTACGGCAGTTTTGGCGGCAAGGCATCGATTGAGCATTTCAC 5200 
SerSerGlyTyrGlySerPheGlyGlyLysAlaSerlleGluHisPheTh 

5201 TCAGTTGCGCTGGGTCACCCTCCAGAATGGTCCACGGCACTATCCGATCT 5250 
rGlnLeuArgTrpValThrLeuGlnAsnGlyProArgHisTyrProIleE 

5251 GA 5252 
nd 
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Nucleotide sequence of p FfQO I (I 8 Kb EcoRI/PstI frq gment ex pFI9711 

1 GAATTCGGGA TCTGGGCTGC CAACCAGTTG GAAGAAAAGA TTCTCGAACT 
51 CGGTGTCGAC AACGTCGGCG CCTTCATTGC CGAGCCGATC CAGGGCGCCG 
101 GCGGCGTGAT CGTGCCGCCA GAAAGCTACT GGCCGCGCAT CAAGGAAATC 
151 CTCGCCAAGT ACGACATCCT GTTCGTCGCC GATGAAGTGA TTTGCGGTTT 
201 CGGCCGTACC GGCGAGTGGT TCGGCAGCGA TTTCTACGAC CTCAAGCCCG 
251 ACATGATGAC CATCGCCAAG GGCCTGACTT CCGGCTACAT CCCGATGGCT 
301 GGTCTGATCG TGCGCGATTC GGTGCTCGAA GTGCTGAACG AAGGCGGCGA 
351 TTTCAACCAC GGATTCACCT ACTCCGGTCA CCCGGTAGCG GCGGCTGTTG 
401 CCCTGGAAAA CATCCGCATC ATGCGCGAAG AGAAGATTAT CGAGCGCGTC 
451 CAGGAAGAAA CGGCACCGTA TTTGCAAAAG CGTCTGCGTG AACTCAACGA 
501 TCATCCATTG GTGGGTGAAG TTCGCGGGGT AGGGTTGCTG GGCGCTATCG 
551 AACTGGTTCA GGACAAAGCC ACGCGCAAAC GTTACGAAGG CAAGGGCGTG 
601 CGCATGATCT GCCGGCAGTT CTGCTTCGAC AACGGGCTGA TCATGCGCGC 
651 GGTTGGCGAC ACCATGATCA TCGCGCCGCC ACTGGTGATT ACCAAGGCGG 
701 AAATCGATGA GCTGGTGAGC AAGGCACGCA AGTGCCTGGA CCTGACCCTG 
751 AGTGTGTTGC AGGGCTAAGT GCTAGGCTCT GAGCGGGAGT TGTATGAACT 
801 TTCGCTCAGA GCGGTCAGAA AGCTTGGCCT TTCCTTGAAA GACCGCCATC 
851 GATGTTGCCA GACTAGCCAC CGTTCCAAAT GCCCGGGTTC GGCGCGGAAC 
901 AGGTGGTTCA AAAAAGCAAA AATTTGGAGC ATTACGCATG AAGGCACTCG 
951 GTAAAAAGCT CGCCGGCAAG ACACTCCTTG CCATCTCCCT GATGGGCATC 
1001 ATGGCGGGCG CGGTTCAGGC AGATGACAAA GTCTTGCACG TGTACAACTG 
1051 GTCCGATTAC ATCGCGCCGG ACACCATCAA GAAGTTTGAA GACGAGTCGG 
HOI GCATCAAGGT GGTCTACGAC GTCTTCGACA GTAACGAAAC CCTCGAAGCC 
1151 AAGTTGCTGG CCGGCAAGTC CGGTTACGAC ATCGTGGTGC CTTCGAACAA 
1201 CTTCCTGGCC AAGCAGATCA AGGCCGGCGT CTACCAGAAG CTGGACAAGT 
1251 CCAACCTGCC GAACTGGAAG AACCTCAACA CCGATCTGCT CAAGGCCGTT 
HO, TCGGTCAGCG ACCCTGGTAA CGAGCACGCC TTCCCGTACA TGTGCGGCT' ' 
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1351 GATCGGCATC GGCTTCAACG CCGAGAAGGT CAAGGCCGCG CTGGGTCCGG 
14 01 ATGCACCGAC CAATTCCTGG GACCTGATCT TCAAACCGGA AAACGCCGCC 

14 51 AAGCTGAAAT CCTGTGGCAT CAGCGTGCTG GATTCGCCAA CCGAGATGAT 

1501 TCCGGTGGCC CTGCACTACC TGGGCTACCC GACCGACAGC CAGGACAAGA 

1551 AACAACTGGC CGAGGCCGAG GCACTGTTCC TCAAAGTTCG TCCTTCGATC 

1601 GGTTACTTCC ACTCCTCCAA GTACATTTCC GACCTGGCCA ACGGCAACAT 

L651 CTGCGTGGCG ATCGGCTACT CGGGTGACAT CTATCAGGCC AAGACTCGCG 

L701 CCGCCGAAGC CGGTGACAAG GTCAAGGTCA GCTACAACAT TCCCAAAGAA 

751 GGTGCAGGCA GCTTCTACGA CATGGTCGCC ATCCCTAAAG ATGCCGAAAA 

801 CGTCGAAGGC GCCTACAAGT TCATGACCTT CCTGCAG 
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Nucleotide sequence pFI91 1 f850bp EcoRI/PstI fragment cx pFI973) 

1 CTGCAGACCT TCTGCCAGGC GCACCGGCTC ACGCAGGGTT TTGACTTCCT 

51 GGATCATCAG GCGCTGGTTG CGTTTGACCG ACTGGCCAAT CATCAGGCGA 

101 AACGCATTGG AGCCCAGGTC GATAGCGGCG AATAGCGATG CGTCTTCTTT 

151 CACGTGAGGA ACTCCTGGCA ACTTCGTCCG CCGAGGGCAA AAAACCGCTT 

201 TTGCCGATCC TGCACGGGGT AGATGACATC AGGATGACAT TGGAAATTTT 

251 TCTGACAGAC GTTTCGTCAC CAGAACGTCA CAGTCGCGGG GCTAGCATCG 

301 GGGCTTCCAA TCCGGTCGGG AGCCTTGAAC ATGCTGTTAA CCAACGACAC 

351 CCTGATGCAT CGCATCCACC GCGAGTTGCT CGACCACAGT GACGAAGAGC 

401 TGGAACTGGA GTTGCTGGAA GACGATCACG ACCTGGCTTC GCTGTTCGCG 

451 GATCAACCGG GCGATACCCC GGCCAAGGCC GAGCGCCGTC GTTACTTCAG 

501 CGAGTTGTTC CGTTTGCAGG GCGAGTTGGT CAAGTTGCAA AGCTGGGTGG 

551 TGAAGACCGG GCACAAGGTG GTGATTCTGT TCGAAGGCCG CGATGCCGCT 

601 GGCAAAGGGG GCGTGATCAA GCGCATCACC CAGCGTCTTA ATCCACGGGT 

651 CTGCCGGGTC GCGGCGCTTC CCGCGCCGAG TGACCGCGAG CGCACCCAGT 

701 GGTATTTCCA GCGTTATGTC TCGCACCTGC CCGCCGCCGG CGAGATCGTC 

751 CTGTTCGACC GCAGCTGGTA CAACCGCGCC GGTGTCGAGC AGGTGATGGG 

801 CTTTTGCAAC GAGGAACAGT ACGAAGAATT TTTCCGCAGC GTGCCGGAAT 

851 TC 
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Nucleotide sequence P FI912 f95 «b p EcoRJ/PstI fragment ex pR793) 

1 CTGCAGGGCC TGGGGCATGC CGAGTCGGCG TCGCAGAACG CCTCTGCCTA 

51 TGCGCTGGAA CGCAAGCAAA TGCGTGCGCC CGCTCGCCCG GTCGGAGTCG 

101 AAGCCGAAGT GGCCGACCCC ATTCATTTTC ATCCGGCCAT GCGCCGGGTG 

151 TTGCTGGAAC TGAGGGCCTA TGCCGAGGGC ATGCGTGCGG TCCGTTACTG 

201 GGCGGCGCAT TTGTTGGATC AGTCCGAGCA GGCCGAGGAT CTGCCCACTC 

251 GTCAGCGCGC CTTGCAACTG GCGGAGCTGC TGACGCCGGT GATCAAGGCG 

301 TTCTTCACCG AGCAGGGTTT TCGCCTGGCC AGCAACGCCT TGCAGGTGTT 

351 CGGTGGCTAC GGCTACGTCA GCGAGTTCGC CATCGAACAG ACCCTGCGCC 

401 ACAGCCGGAT CGCGATGATT TACGAGGGCA GCAACGAAAT CCAGGCCAAT 

451 GACCTGCTGC TGCGCAAAGT GCTGGGGGAT GAAGGTCGCG CCTTTGGCCA 

501 ACTGTTGGCG GTCATGCGCG AAGAGGCCGA ACTGGCCTGT AACGACACCC 

551 GCTTTGGCGC TGAGCTGGTG CAGCTGTGCG ACAAACTCGA GACAGTGCAA 

601 CTTGAGATAG GGGACCTCGC CGTCACGGAG CGCGAATACC CGTATCGAGC 

651 CGCTGGCGAT TTCCTGCGCC TGTGTGGCGT GGCGCTGTTG GGGTTTTCCT 

701 GGGCGAGAGC GGCACGGGTG TCTCGCCTGT TACCTGACAG CGATCCACTG 

751 CGTCCCAACA AACTGGAAAC CGCGCGTTTC TTCTTTGCCT ACCTGCTGCC 

801 AGAAGCCGAT CAACGCCTCG CAGCCATTCG GGCGGCGAGA GCGCCGTTGC 

851 CGTTTTTGAT CTGAAAAAAC GCCCGCCAGG CCCAATGTGG CTCGCTCCCA 

901 CAAACAGCGC GAACCACATC GAGCCACCGC CGCCACGCCA GTTGTACAGG 

951 CCGAATTC 
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Nucleotide sequence pFI9I3 f959h p EcoRr/Pstr fragment ex pFI793) 

1 CTGCAGGCTT GCCATATCAG TGGCGACAGC TTCGTCCGCG CTCCAGGCGC 
51 AAGGGCCAAC GACCTGCCGA CGGTGCCGAA TAGGCTGTCG GCGTCCGTTA 
101 TTCTGGACGC ACCGCAAAAA CTGTTATTTA CCCGGTCTTC TTCCACTGTA 
151 GAACCTTTTC ACTATAGCGG CCCTGCGTGT TCTGCGGGAG CTGCTCATGA 
201 TTCTGCACGC GATTCCACTT CCAGCCCGTT GCCGCGCCGT GCTGTTGCGG 
251 TTTCTGCACG CACGGCTTTT GCATCAGGCT TGCACAGCCA GCCACAAGGG 
301 CAGGTAAGCT CTAGCTCGCA CGTCCTGGGC GTCTCCCAGG TCTGCCAACG 
351 CGACGCGGAC GCGTCAAACA ACGCCCGGCC CCTAATGAAG CCGGGACACT 
401 CAGCCCAGAG GCATTTATGA GTAACAACCT CGACCAGCTC ACCGATTGGT 
451 TGAAAGACCA CAAGATCACA GAAGTCGAAT GCATGATTGG CGACTTGACC 
501 GGGATCACCC GCGGCAAGAT CTCGCCAACC AACAAGTTCA TTGCCGAAAA 
551 AGGCATGCGC CTGCCCGAGA GTGTGCTGTT GCAGACAGTG ACGGGCGACT 
601 ATGTCGAAGA CGACATCTAT TACGAACTGC TCGACCCGGC CGACATCGAC 
651 ATGATCTGCC GCCCCGACCA GAACGCGGTG TTCCTCGTGC CATGGGCCAT 
701 CGAGCCGACC GCGCAGGTGA TTCACGACAC CTACGACAAG CAGGGCAACC 
751 CGATCGAGCT GTCGCCACGC AACGTCCTCA AGAAAGTCCT CAAACTCTAT 
801 TCCGACAAGG GCTGGCAGCC GATCGTGGCG CCGGAAATGG AGTTCTACCT 
851 GACCAAGCGC AGTGACGACC CGGATTACCC ATTGCAACCG CCGGTTGGCC 
901 GTTCCGGACG TCCGGAAATC GGTCGCCAAT CGTTCTCTAT CGAAGCGGCC 
951 AACGAATTC 
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Outward reading POR p rimprc 



PFI901 



pFI911 



pFI912: 



pFI913: 



P35 

P36 

P32 

P37 

P33 

P38 

P34 
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Nucle otide sequence merged contigs pFI91 1/PCR p rodiir.t/ pFI9QI f42_59hp ) 

1 CTGCAGGCTT GCCATATCAG TGGCGACAGC TTCGTCCGCG CTCCAGGCGC 
51 AAGGGCCAAC GACCTGCCGA CGGTGCCGAA TAGGCTGTCG GCGTCCGTTA 
101 TTCTGGACGC ACCGCAAAAA CTGTTATTTA CCCGGTCTTC TTCCACTGTA 
151 GAACCTTTTC ACTATAGCGG CCCTGCGTGT TCTGCGGGAG CTGCTCATGA 
201 TTCTGCACGC GATTCCACTT CCAGCCCGTT GCCGCGCCGT GCTGTTGCGG 
251 TTTCTGCACG CACGGCTTTT GCATCAGGCT TGCACAGCCA GCCACAAGCG 
301 CAGGTAAGCT CTAGCTCGCA CGTCCTGGGC GTCTCCCAGG TCTGCCAACG 
351 CGACGCGGAC GCGTCAAACA ACGCCCGGCC CCTAATCAAG CCGGGACACT 
401 CAGCCCAGAG GCATTTATGA GTAACAACCT CGACCAGCTC ACCGATTGGT 
451 TGAAAGACCA CAAGATCACA GAAGTCGAAT GCATGATTGG CGACTTGACC 
501 GGGATCACCC GCGGCAAGAT CTCGCCAACC AACAAGTTCA TTGCCGAAAA 
551 AGGCATGCGC CTGCCCGAGA GTGTGCTGTT GCAGACAGTG ACGGGCGACT 
601 ATGTCGAAGA CGACATCTAT TACGAACTGC TCGACCCGGC CGACATCGAC 
651 ATGATCTGCC GCCCCGACCA GAACGCGGTG TTCCTCGTGC CATGGGCCAT 
701 CGAGCCGACC GCGCAGGTGA TTCACGACAC CTACGACAAG CAGGGCAACC 
751 CGATCGAGCT GTCGCCACGC AACGTCCTCA AGAAAGTCCT CAAACTCTAT 
801 TCCGACAAGG GCTGGCAGCC GATCGTGGCG CCGGAAATGG AGTTCTACCT 
851 GACCAAGCGC AGTGACGACC CGGATTACCC ATTGCAACCG CCGGTTGGCC 
901 GTTCCGGACG TCCGGAAATC GGTCGCCAAT CGTTCTCTAT CGAAGCGGCC 
951 AACGAATTCG ACCCGCTGTT CGAAGACGTC TACGACTGGT GCGAACTGCA 
1001 GGAGCTGGAT CTCGATACGC TGATCCACGA AGACGGCACG GCGCAGATGG 
1051 AAATCAACTT CCGTCACGGC GACGCGCTGT CCCTCGCCGA CCAGATCCTG 
1101 GTGTTCAAGC GCACCATGCG CGAGGCCGCG CTCAAGCACA ACGTGGCCGC 
U5 1 CACGTTCATG GCCAAGCCGA TGACCGGCGA GCCTGGCAGC GCCATGCACC 
1201 TGCACCAGAG CATCATCGAT ATCGAGACCG GCAAGAACGT CTTCTCCAAT 
1251 GAAGACGCGA GCATGAGCCA GTTGTTCCTC AACCACATCG GCGGCCTGCA 
130) CAAATPCATC CCTGAACTCC TCCCGCTGTT CCCCCCCAAC frrCAACTCCT 
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1351 TCCGCCGCTT CCTGCCGGAC ACTTCGGCGC CGGTGAACGT CGACTGGGGC 
HOI GAAGAAAACC GTACCGTGGG CCTGCGGGTG CCGGATGCCG GCCCTCAAAA 
H5! CCGTCGGGTG GAAAACCGCC TGCCGGGTGC CGACGCCAAC CCGTACCTGG 
1501 CGATTGCCGC GAGCCTGCTG TGCGGCTACA TCGGCATGGT CGAAGGTATC 
1551 AACCCAAGCG CGCCTGTGGT GGGTCGTGGT TACGAGCGGC GCAACCTGCG 
1601 TCTGCCGCTG ACCATCGAAG ACGCTCTGGA ACGCATGGAA AACAGCAAGA 
1651 CCATCGAGAA ATACCTGGGT CACAACTTCA TCACTGGCTA CGTCGCGGTC 
1701 AAGCGGGCCG AGCATGAAAA CTTCAAGCGC GTGATCAGCT CATGGGAACG 
1751 GGAATTCCTG TTGTTCGCCG TCTGACACGC CGGGTGCGGC CCTCAAAAGC 
1801 CGCACTCCAA CCTCACTAGG AGAGCTTTAT GAGCAACAAC CCGCAAACCC 
1851 GTGAATGGC A GAACCTGAGC GCCGAACACC ACCTGGCCCC CTTCAGTGAC 
1901 TTCAAGCAAT TGAAGGAAAA AGGCCCGCGC GTCATCACCA GCGCCAAGGG 
1951 CGTTTACCTG TGGGACAGCG AAGGCAATCA GATCCTCGAC GGCATGGCCG 
2001 GCCTGTGGTG CGTGGCCATC GGTTACGGCC GCGACGAGTT GGCCGAGGCT 
2051 GCCAGCAAGC AGATGCGCGA GTTGCCGTAC TACAACCTGT TTTTCCAGAC 
2101 CGCTCACCCG CCCGTCCTCG AGCTGGCCAA GGCAATTTCC GATATCGCGC 
2151 CAGCAGGCAT GAACCACGTG TTCTTCttCC GTTCCGGCTC CGAAGGCAAT 
2201 GACACCATGC TGCGCATGGT TCGCCACTAC TGGGCGATCA AAGGTCAGCC 
2251 AAACAAGAAA GTCATTATCA GCCGCAAGAA CGGCTACCAC GGTTCGACCG 
2301 TGCCCGGCGC CAGCCTGGGC GGCATGACCT ACATGCACGA ACAGGGCGAC 
2351 TTGCCGATCC CGGGCATCGT GCACATTCCG CAGCCGTACT GGTTCGGTGA 
2401 AGGCGGCGAC ATGACCCCGG AAGAATTCGG GATCTGGGCT GCCAACCAGT 
2451 TGGAAGAAAA GATTCTCGAA GTCGGTGTCG ACAACGTCGG CGCCTTCATT 
2501 GCCGAGCCGA TCCAGGGCGC CGGCGGCGTG ATCGTGCCGC CAGAAAGCTA 
2551 CTGGCCGCGC ATCAAGGAAA TCCTCGCCAA GTACGACATC CTGTTCGTCG 
2601 CCGATGAAGT GATTTGCGGT TTCGGCCGTA CCGGCGAGTG GTTCGGCAGC 
2651 GATTTCTACG ACCTCAAGCC CGACATGATG ACCATCGCCA AGGGCCTGAC 
2701 TTCCGCCTAC ATCCCGATGG GTGGTCTGAT CGTGCGCGAT TCGGTGGTCG 
Ml A AGTG CTG AA CGAACGCGCC GATTTCAACC ACGCATTCAC CTACTCCCGT 
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2801 CACCCGGTAG CGG6GGCTGT TGCCCTGGAA AACATCCGCA TCATGCGCGA 
2851 AGAGAAGATT ATCGAGCGCG TCCAGGAAGA AACGGCACCG TATTTGCAAA 
2901 AGCGTCTGCG TGAACTCAAC GATCATCCAT TGGTGGGTGA AGTTCGCGGG 
2951 GTAGGGTTGC TGGGCGCTAT CGAACTGGTT CAGGACAAAG CCACGCGCAA 
3001 ACGTTACGAA GGCAAGGGCG TGGGCATGAT CTGCCGGCAG TTCTGCTTCG 
3051 ACAACGGGCT GATCATGCGC GCGGTTGGCG ACACCATGAT CATCGCGCCG 
3101 CCACTGGTGA TTACCAAGGC GGAAATCGAT GAGCTGGTGA GCAAGGCACG 
3151 CAAGTGCCTG GACCTGACCC TGAGTGTGTT GCAGGGCTAA GTGCTAGGCT 
3201 CTGAGCGGGA GTTGTATGAA CTTTCGCTCA GAGCGGTCAG AAAGCTTGGC 
3251 CTTTCCTTGA AAGACCGCCA TGGATGTTGC CAGACTAGCC ACCGTTCCAA 
3 301 ATGCCCGGGT TCGGCGCGGA ACAGGTGGTT CAAAAAAGCA AAAATTTGGA 
3351 GCATTACGCA TGAAGGCACT CGGTAAAAAG CTCGCCGGCA AGACACTCCT 
3401 TGCCATGTCC CTGATGGGCA TCATGGCGGG CGCGGTTCAG GCAGATCACA 
3451 AAGTCTTGCA CGTGTACAAC TGGTCCGATT ACATCGCGCC GGACACCATC 
3501 AAGAAGTTTG AAGACGAGTC GGGCATCAAG GTGGTCTACG ACGTCTTCGA 
3S51 CAGTAACGAA ACCCTCGAAG CCAAGTTGCT GGCCGGCAAG TCCGGTTACG 
3601 ACATCGTGGT GCCTTCGAAC AACTTCCTGG CCAAGCAGAT CAAGGCCGGC 
3651 GTCTACCAGA AGCTGGACAA GTCCAAGCTG CCGAACTGGA AGAACCTGAA 
3701 CACCGATCTG CTCAAGGCCG TTTCGGTCAG CGACCCTGGT AACGAGCACG 
3751 CCTTCCCGTA CATGTGGGGC TCGATCGGCA TCGGCTTCAA CGCCGAGAAG 
3801 CTCAAGGCCG CGCTGGGTCC GGATGCACCG ACCAATTCCT GGGACCTGAT 
3851 CTTCAAACCG GAAAACGCCG CCAAGCTGAA ATCCTGTGGC ATCAGCGTGC 
3901 TGGATTCGCC AACCGAGATG ATTCCGGTGG CCCTGCACTA CCTGGGCTAC 
3951 CCGACCGACA GCCAGGACAA GAAACAACTG GCCGAGGCCG AGGCACTGTT 
4001 CCTCAAAGTT CGTCCTTCGA TCGGTTACTT CCACTCCTCC AAGTACATTT 
4051 CCGACCTGGC CAACGGCAAC ATCTGCGTGG CGATCGGCTA CTCGGGTGAC 
4 10! ATCTATCAGG CCAAGACTCG CGCCGCCGAA GCCGGTGACA AGGTCAAGGT 
4 151 CAGCTACAAC ATTCCCAAAG AAGGTGCAGG CAGCTTCTAC GACATGGTCG 
4 201 CCATCCCTAA AGATOCrCAA AACGTCGAAf rrrrr-T»r«» 
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